Suggested Citation:
"Items for Ongoing Consideration." National Research Council. 1996. Massive Data Sets: Proceedings of a Workshop. Washington, DC: The National Academies Press.
doi: 10.17226/5505.
Items for Ongoing Consideration
Data Preparation
Elevation of status of data preparation and data quality stages in professional societies
Clear articulation of what is meant by a massive data set
Development of rigorous, theory-based methods for reduction of dimensionality
Systematic study of how, when, and why methods used with small and medium-sized data sets break down with large size data sets; understanding of how far current methods, both statistical and computational, can be pushed; articulation of the variety of models that might be useful
Development of methods for integration of tools and techniques
Development of specialized tools in general "packages" for non-standard (e.g., sensor-based) data
Establishment of better links between statistics and computer science
Exploration of the use of "infinite" data sets to stimulate methods for massive data sets
Creation of richer language for describing structure in data
Educational opportunities—for nonstatisticians who use some statistical techniques and for statisticians, to broaden the knowledge base and provide better links to computer science
Models and Data Presentation Research Issues
Discovery and comparison of homogeneous groups
Communication and display of variability and bias in models
Better design of hierarchical visual display
New modeling metaphors and richer class of presentation approaches
Methods to help "generalize" and "match" local models (e.g., automated agents)
Robust or multiple models; sequential and dynamic models
Suggested Citation:
"Items for Ongoing Consideration." National Research Council. 1996. Massive Data Sets: Proceedings of a Workshop. Washington, DC: The National Academies Press.
doi: 10.17226/5505.
Alternatives to internal cross-validation for model verification
Retooling of computing environment for modeling massive data sets
Simple presentation of ''massive'' complex data analyses
Suggested Citation:
"Items for Ongoing Consideration." National Research Council. 1996. Massive Data Sets: Proceedings of a Workshop. Washington, DC: The National Academies Press.
doi: 10.17226/5505.
Suggested Citation:
"Items for Ongoing Consideration." National Research Council. 1996. Massive Data Sets: Proceedings of a Workshop. Washington, DC: The National Academies Press.
doi: 10.17226/5505.
Sign in to access your saved publications, downloads, and email
preferences.
Former MyNAP users: You'll need to reset your password on your first
login to MyAcademies. Click "Forgot password" below to receive a reset
link via email. Having trouble?
Visit our FAQ page
to contact support.
Members of the National Academy of Sciences, National Academy of
Engineering, or National Academy of Medicine should log in through their
respective Academy portals.
Register
Register
Download as a Guest
Download as a Guest
While logged on as a guest, you can download any of our free PDFs on
nationalacademies.org
. You will remain logged in until you close your browser.
Thank You
Thank You
Thank you for creating a MyAcademies account!
Enjoy free access to thousands of National Academies' publications, a
10% discount off every purchase, and build your personal library.
Forgot Password
Forgot Password
Enter the email address for your MyAcademies (formerly MyNAP) account to
receive password reset instructions.
Reset Requested
Reset Requested
We sent password reset instructions to
your email
. Follow the link in that email to create a new password. Didn't receive
it? Check your spam folder or
contact us
for assistance.
We sent a verification link to your email. Please check your inbox (and
spam folder) and follow the link to verify your email address. If you
did not receive the email, you can request a new verification link below