Data Links

 

The links below are either for data collections or pages like this one with links to more data. Quoted material is taken from the linked page, while the remaining remarks are my observations.

 

 General/Multiple

Data and Story Library DASL (pronounced "dazzle") is an online library of datafiles and stories that illustrate the use of basic statistics methods. We hope to provide data from a wide variety of topics so that statistics teachers can find real-world examples that will be interesting to their students. Use DASL's powerful search engine to locate the story or datafile of interest.

Guide to the Web for Statisticians: Data Sets From the University of Queensland, links to many data sets.

Journal of Statistics Education Data Archive Data sets with clear documentation and, for many, article citations of papers that analyzed the data.

Statistics and Statistical Graphics Resources Maintained by Michael Friendly, Statistical Consulting Service and Psychology Department, York University, "This page provides an annotated, topic-based collection of available resources for statistics, statistical graphics, and computation related to research, data analysis and teaching, now containing over 580 links."

"Fishing for Data using the NET" Homepage Designed by Leon L. Bourn and Stephen J. Turner, site maintenance and research by Julia N. Phinyawatana, has categorized links to multiple data sites.

Electronic Dataset Service Links to datasets organized by method. Office of Information Technologies From "Academic Computing, A125 Lederle Graduate Research Center (lowrise) University of Massachusetts Amherst."

Statlib JASA Data Archive "The jasadata archive contains contributed datasets from articles published in the Journal of the American Statistical Association."

Statlib Links to multiple data sets including above.

Dr. B's Wide World of Web Data-From ASU Research Methods " Overview. This archive is a set of links to data and depictions of data from throughout the world. It is organized by topic areas. We hope instructors will use these data for examples in classes AND set students loose to find data that THEY find interesting." Copyright © 1995 John Behrens.

EDINA® Edinburgh Data and Information Access "providing national online services for the UK tertiary education and research community."

University of Edinburgh Data Library "Sources of Data on the Internet (by category)"--Links to European and some international data sets.

Statistics Resources: Data Resources A choice from among " a comprehensive guide to the most commonly-used statistics resources on the Web." from cti, Statistics Department of the University of Glasgow.

Robin Lock's Homepage Good links to data sets and instructional resources.

San Diego SourceBook Various data sets, usually in tabular form, some with graphical representations.

Statistics Canada Data sets plus additional links.

Chance Data Sets Data sets "to help students understand issues that may not be found in a standard statistics text."

Data and Statistical Services Princeton University " The Data Library is a collection of over 24,000 machine-readable files. Electronic data is gathered primarily for the social sciences but also for the sciences and humanities. Data are available for the United States and foreign countries. The Data Library includes extensive holdings for Economic Time Series and U.S. Census Data, and a range of other subject areas spanning historical and contemporary events, factual and opinion-oriented studies, and single and multiple country coverage."

OzDASL " OzDASL is a library of data sets and associated stories. It is intended as a resource for teachers of statistics in Australia and New Zealand, and emphasis is given to data sets with an Australasian context. The collection is still growing. Interesting contributions are welcomed at any time. Data sets are grouped by the most obvious applicable methodology."

UK National Statistics website "This site contains the latest comprehensive range of official UK statistics and information about statistics as well as providing free access to a selection of recently released publications in downloadable pdf format."

Gallup National polling results.

SADA "The South African Data Archive (SADA) serves as a broker between a range of data providers (e.g. statistical agencies, government departments, opinion and market research companies and academic institutions) and the research community. The archive does not only preserve data for future use, but also adds value to the collections. It safeguards datasets and related documentation and attempts to make it as easily accessible as possible for research and educational purposes."

UCLA Statistics Dept. Some data links.

Exploring data " Students should work with real data. So here is some, available in tab-delimited, Excel 4.0 and NCSS 6.0 Jr formats."

EDS DataGate Links to the Columbia University Electronic Data Service Data Gate and other data sets.

 Sanders & Smidt Data sets from "A First Course in Statistics"

 

Agriculture

Agriculture Statistics Of People's Republic Of China: 1949-1990 This dataset was provided by United States Department of Agriculture, with the original data coming from the State Statistics Bureau of P. R. China.. The report compiles much of the important agricultural data that China's State Statistical Bureau (SSB) reported for 1949-90. A total of 297 variables are provided in 10 data groups, which include land use, population, labor force, gross value of agricultural output, agricultural investment, crop production, crop sown area, state procurement, livestock inventory and slaughter, animal product output, input production and use, costs of production, consumption of agricultural commodities, selected retail price indices and mixed average procurement prices for selected agricultural crops, and quantity and value of imports and exports of selected commodities. Data are aggregated at the national level and the provincial level where available.

Food and Agriculture Organization of the United Nations "FAOSTAT is an on-line and multilingual databases currently containing over 1 million time-series records covering international statistics in the following areas: Production, Trade, Food Balance Sheets, Fertilizer and Pesticides, Land Use and Irrigation, Forest Products, Fishery Products, Population, Agricultural Machinery, Food Aid Shipments.

Agricultural Statistics From USDA - National Agricultural Statistics Service "Agricultural Statistics is published each year to meet the diverse need for a reliable reference book on agricultural production, supplies, consumption, facilities, costs, and returns."

U.S. Department of Agriculture National Agricultural Statistics Service "Several links to ag data.

USDA Economics and Statistics System "The USDA Economics and Statistics System contains nearly 300 reports and datasets from the economics agencies of the U.S. Department of Agriculture. These materials cover U.S. and international agriculture and related topics. Most reports are text files that contain time-sensitive information. Most data sets are in spreadsheet format and include time-series data that are updated yearly."

ERS Economic Research Service, US Dept of Agriculture. " ERS produces a range of data products available in different formats, including online databases, spreadsheets, and web files. You can search by topic, by title, or by date. All data products online are available at no charge

 

Business/Economics

Bureau of Labor Statistics Data Variety of data sets. FedStats: Data Toolkit "The gateway to statistics from over 100 U.S. Federal agencies."

Time Series Data Library This is a collection of over 500 time series, maintained by Rob Hyndman, Director of Consulting and Associate Professor in the Department of Econometrics and Business Statistics at Monash University.

Economic Time Series Page "Browse Data Collections Over 100,000 data files, with charts and excel files for each."

NIPA National income and product accounts (NIPA) show the value and composition of the Nation's output and the distribution of incomes generated in its production. The accounts include estimates of gross domestic product(GDP) - the market value of the Nation' s output of goods and services - in current and real terms, GDP price measures, the goods and services that make up GDP in current and real terms, national income, personal income, and corporate profits. In addition, BEA produces specialized measures such as estimates of auto and truck output, GDP of corporate business, housing output, and business inventory and sales."

BEA Bureau of Economic Analysis, U.S. Dept. of Commerce

 

Crime and law Enforcement

Office of Justice Programs Large justice/crime data sets with SAS and SPSS data definition files.

U.S. Department of Justice, Bureau of Justice Statistics BJS home page, Data for analysis Online tabulations, datasets & codebooks

The Sourcebook of Criminal Justice Statistics "brings together data from more than 100 sources about all aspects of criminal justice in the United States, which are presented in over 600 tables."

Uniform Crime Reports U.S. Department of Justice, Federal Bureau of Investigation, Criminal Justice Information Services (CJIS) Division Uniform Crime Reports. Many of these reports are in PDF format. To view those documents you will need Adobe Acrobat Reader.

 

Demographics

Government Information Sharing Project-Oregon State University A variety of demographic, economic, and educationally related data sets.

US Census Bureau A wee bit of data.

PaSDC Pennsylvania State Data Center. "The PaSDC is the Commonwealth's official source of demographic and economic data."

Statistical Abstract of the United States From the US Census Bureau, publications with data in pdf format.

 

Education

National Center for Education Statistics, Encyclopedia of Education Stats "The Encyclopedia of ED Stats brings together data from several NCES sources including: The Condition of Education, The Digest of Education Statistics, and Projections of Education Statistics. You will be able to find information in these compendiums by searching documents through their table of contents (see below), by subject area, and through full text and table title word searches."

 National Center for Education Statistic Quick Tables and Figures Search engine for many data sets.

 

Engineering/Science

The Statistical Reference Datasets Project Developed by staff of the Statistical Engineering Division and the Mathematical and Computational Sciences Division within the Information Technology Laboratory of the National Institute of Standards and Technology. " The purpose of this project is to improve the accuracy of statistical software by providing reference datasets with certified computational results that enable the objective evaluation of statistical software."

Energy Information Administration "Official Energy Statistics from the U.S. Government."

WDC The World Data Center for Marine Geology & Geophysics, Boulder. "promotes excellence in archiving, managing, and exchanging data obtained from measurements of the seafloor."

SESTAT "A comprehensive and integrated system of information about the employment, educational and demographic characteristics of scientists and engineers in the United States."

 

Environment/Natural Resources

EPA Databases and Software Links to EPA databases.

Long Term Ecological Research Site Data from Oregon State's H.J. Andrews Experimental Forest. "Our data sets consist of long-term research study data with accompanying documentation (metadata). Spatial and non-spatial data sets are available as well as software and models.

NERR System Wide-Monitoring Program NERR = National Estuarine Research Reserve. "The goal for the System-wide Monitoring Program is 'to identify and track short-term variability and long-term changes in the integrity and biodiversity of representative estuarine ecosystems and coastal watersheds for the purposes of contributing to effective national, regional, and site specific coastal zone management'."

Estuary-Net Volunteer Monitoring Data Data Directory for individual estuaries provided by volunteers.

NOAA Paleoclimatology Program "the NOAA Paleoclimatology Program at the National Geophysical Data Center, a central location for paleoclimate data, research, and education. NOAA Paleoclimatology helps the World share scientific data and information related to climate system variability and predictability."

OTAG/AQA Data sets from the Air Quality Analysis workgroup of the Ozone Transport Assessment Group.

Nebraska Data Bank "The Natural Resources Data Bank was statutorily created in 1969 and is administered by the Department of Natural Resources (DNR). The purpose of the Data Bank is to develop, store, process and manage natural resources data relating to soil and water resources of the State, and make the information available to government agencies, academia, the private sector and the general public in a user-friendly and timely manner."

The Data Zoo The Data Zoo at CCS (Center for Coastal Studies, Scripps Institution for Oceanography) contains data collected by various California coastal data collection programs and studies. The Data Zoo is brought to you by funding from the Minerals Management Service (MMS).

U.S. Geological Survey A variety of geological data sets.

La Graciosa Thistle MINITAB Worksheet. This is data collected byCal Poly bio grad student Mary Lea. It was collected on the La Graciosa Thistle in the Guadalupe dunes. Predictors: total diameter = the sum of the diameters of all the singleton flower heads on a plant; Multidiameter = the sum of the diameters of all the multiple flower heads on a plant; Viable Heads = the number of heads on the plant which appear to contain fertilized seeds; Actual Heads = the total number of heads on the plant including those which don't appear to be fertilized. Response: Viable achenes = the number of seeds which appear to be fertilized. Supplied by Andrew Schaffner, Cal Poly Statistics Department.

 

Health/Medicine

Oregon Health Division: Health Statistics & Information Various health related data sets.

American Heart Association Statistics " The American Heart Association is the leading authority on heart and blood vessel diseases. Statistical information on specific heart and blood vessel diseases can be found throughout this Web site.

AskC/Net U.S. Cancer Data

WHO Statistical Information System (WHOSIS) Health and health-related statistical information from the WHO Global Programme on Evidence for Health Policy.

CDC Center for Disease Control and Prevention, Data and Statistics

National Center for Health Statistics U.S. Department Of Health And Human Services Centers for Disease Control and Prevention.

 

Social Science/Humanities

Social Science Data on the Internet U of Ca., San Diego" Data on the Net Search or browse our listing of 851 Internet sites of numeric Social Science statistical data, data catalogs, data libraries, social science gateways, addresses and more.

Statistics and Social Science Group at NYU Links to many data sets, not all social science oriented.

The White House Social Sciences Briefing Room "The purpose of this service is to provide easy access to current Federal social statistics. It provides links to information produced by a number of Federal agencies. All of the information included in the Social Statistics Briefing Room is maintained and updated by the statistical units of those agencies. All the estimates for the indicators presented in the Federal Statistics Briefing Rooms are the most currently available values."

Data Resource for Sociologists " ASA is pleased to provide this on-line module on Data Resources for sociologists."

Social Science & Government Data Library U of C, Berkeley-- 1990 Census of Population and Housing Subject Summary Tape Files

Data Library Service University of Toronto " The collections of the DLS consist of numeric, spatial and textual research data files, primarily but not exclusively in the social sciences. These files contain quantitative research data, including microdata, aggregate data and time-series databases."

CESSDA Council Of European Social Science Data Archives. An amazing number of international data links--just click on the map. "Welcome to the CESSDA (Council of European Social Science DataArchives) home pages. CESSDA promotes the acquisition, archiving and distribution of electronic data for social science teaching and research in Europe. It encourages the exchange of data and technology and fosters the development of new organisations in sympathy with its aims. It associates and cooperates with other international organisations sharing similar objectives."

ARDA The American Religion Data Archive is located in the Department of Sociology at The Pennsylvania State University At Penn State. "The American Religion Data Archive collects quantitative data sets for the study of American religion"

UK Data Archive "The UK Data Archive is a specialist national resource containing the largest collection of accessible computer readable data in the social sciences and humanities in the United Kingdom. Through these web pages it is also possible to search the catalogues of other national archives for computer readable data and to use the services of the Data Archive to acquire these data on your behalf."

ICPSR " Massachusetts Federation ICPSR Data Site. This site provides the research community with direct and easy access to social science data files. From this site, you may search, browse and/or download data from hundreds of SPSS Export files from the local archive." "Established in 1962, the Inter-university Consortium for Political and Social Research (ICPSR) housed at the University of Michigan is a membership-based organization providing access to the world's largest archive of computer-based research and instructional data for the social sciences."

NES "The mission of the National Election Studies (NES) is to produce high quality data on voting, public opinion, and political participation that serve the research needs of social scientists, teachers, students, policy makers and journalists concerned with the theoretical and empirical foundations of mass politics in a democratic society."

SOSIG Social Science Information Gateway. "The Social Science Information Gateway (SOSIG) aims to provide a trusted source of selected, high quality Internet information for researchers and practitioners in the social sciences, business and law. It is part of the UK Resource Discovery Network."

 

Transportation

Insurance Institute for Highway Safety Occasional data sets.

The Century Council "Funded by America's leading distillers," has some interesting data sets, including drunk driving fatalities (Wyoming was more than 3.5 % higher than any other state).

The Bureau of Transportation Statistics, US DOT Links to data on transportation.