Using cceHub components to build simple data exploration tools

By david michael grobe

Indiana University

Category

Seminars

Published on

Abstract

Several components developed as part of the cceHUB database project (namely com_dataview and com_form) were used to PROTOTYPE a new component supporting basic data exploration functions: the creation and display of frequency charts/tables, cross-tabulations, simple regressions, and correlation matrices.

These functions access individual tables already uploaded to databases accessible from the Hub; there is no provision at this time for uploading data that can then be explored, although tools for doing so have also been built using com_form, and could in principle serve that function.

This presentation would hope to identify interest among conference attendees to justify further development of the PROTOTYPE, since this software is not ready for deployment at this time.

For more information, including screen shots, see: http://mypage.iu.edu/~dgrobe/dataexplore/

Bio

Michael works as a Principal Systems Analyst in the Advanced Biomedical IT Core within Indiana University's Pervasive Technology Institute. He has a background and considerable experience in Web development, the construction and management of computer networks, and High Performance Computing. He came to the Pervasive Technology Institute to work on the Centralized Life Sciences Database, a large collection of genomics data, and has worked most recently on a variety of projects related to the Semantic Web and Indiana's CTSI Hub, including installation of the Purdue Database tools, definition of prototype tools for performing simple analyses (frequencies, cross-tabulations, correlation matrices) of data in DB tables, the manipulation of data about science researchers organized by the Vivo Consortium, software for searching the local Clinical Trials database, and is currently exploring the REDCap plugin facility.

Cite this work

Researchers should cite this work as follows:

  • david michael grobe (2012), "Using cceHub components to build simple data exploration tools," https://help.hubzero.org/resources/799.

    BibTex | EndNote

Using cceHub components to build simple data exploration tools
  • Using the Purdue DB Technology to build simple on-demand data exploration tools 1. Using the Purdue DB Technology… 0
    00:00/00:00
  • The general idea 2. The general idea 79.87987987987988
    00:00/00:00
  • The general idea (continued) 3. The general idea (continued) 248.9155822489156
    00:00/00:00
  • The prototype and a question 4. The prototype and a question 330.03003003003005
    00:00/00:00
  • A starting form to create queries on the fly. (http://dev1.indianactsi.org/form?proj_id=menu&form_id=form0) 5. A starting form to create quer… 398.09809809809809
    00:00/00:00
  • Request a frequency table/chart 6. Request a frequency table/char… 435.6022689356023
    00:00/00:00
  • The field name pulldown menu 7. The field name pulldown menu 444.21087754421092
    00:00/00:00
  • The resulting frequency chart 8. The resulting frequency chart 468.93560226893561
    00:00/00:00
  • The cumulative frequency chart 9. The cumulative frequency chart 490.55722389055722
    00:00/00:00
  • Request a chart of partitioned field values 10. Request a chart of partitioned… 501.40140140140142
    00:00/00:00
  • Resulting chart showing partition frequencies 11. Resulting chart showing partit… 519.35268601935275
    00:00/00:00
  • Cross tabulation on 10 partitions of 2 fields 12. Cross tabulation on 10 partiti… 524.52452452452451
    00:00/00:00
  • Cross tabulation (second half) 13. Cross tabulation (second half) 562.06206206206207
    00:00/00:00
  • Resulting tabulation (10 partitions of each variable) 14. Resulting tabulation (10 parti… 572.57257257257254
    00:00/00:00
  • User specified partitions 15. User specified partitions 585.48548548548547
    00:00/00:00
  • Resulting tabulation (user-defined partitions of each variable) 16. Resulting tabulation (user-def… 592.0587253920587
    00:00/00:00
  • Request a simple correlation/regression 17. Request a simple correlation/r… 608.04137470804142
    00:00/00:00
  • Resulting regression info 18. Resulting regression info 618.651985318652
    00:00/00:00
  • A correlation_matrix 19. A correlation_matrix 643.97731064397738
    00:00/00:00
  • Request user-defined displays via URLs 20. Request user-defined displays … 665.331998665332
    00:00/00:00
  • Summary 21. Summary 672.67267267267266
    00:00/00:00
  • Additional information 22. Additional information 688.15482148815488
    00:00/00:00
  • Copyright © 2022 Hubzero
  • Powered by Hubzero®