Skip to Main Content

Finding Datasets

This Guide provides information on how to search out and access pre-existing datasets.

Accessing Datasets Through the TCNJ Library Website

You can find datasets at the Library's Website! This slideshow will show you how. First, search your topic in the search bar on the front page of the website.

(Click the rightward-facing arrow or click and drag the picture to move to the next step).

The Gitenstein Library website's home screen, displaying its central search bar. "Household income" is typed in the search bar as an example search.

You can filter your results using the options on the left sidebar.

The results of a search for "household income" in the Gitenstein Library's website. The left sidebar, titled "Tweak Your Results", contains filters.

To filter your search to just datasets, click the "datasets" option within the "Resource Type" section (you may have to hit "show more" to get "datasets" to appear) then click "apply filters".

The results of a search for "household income" in the Gitenstein Library's website. In the "Resource Type" subsection of the "Tweak your Results" left sidebar, a checkbox next to the "datasets" filter is checked. At the bottom of the sidebar is a green button, labeled "Apply Filters".

Hitting "apply filters" should re-load your search results with a list of datasets available through the library website.

The results of a search for "household income" in the Gitenstein Library's website. Filtering by datasets causes the "Active Filters" section of the left sidebar to display "datasets". Users know each search result is a dataset because each results is labeled "dataset".

Other helpful filters include filtering by subject (so you can make sure your results are relevant to your topic), and...

The results of a search for "household income" in the Gitenstein Library's website. In the "Subject" subsection of the "Tweak your results" section, checkboxes next to two subjects are checked. At the bottom of the sidebar is an "Apply filter" button.

filtering by year (so you can make sure your search results are relevant to the time period of your topic)

The results of a search for "household income" in the Gitenstein Library's website. The "Year" subsection of the "Tweak your results" sidebar contains two fields which allow users to input starting and ending years. A checkbox next to those two fields is checked. At the bottom of the sidebar is an "Apply Filters" button.

Common Sources of Datasets

Common Sources of Datasets

  • Government agencies (e.g., data.gov - source for all the US government's open data)
    • Generally free.
    • Data format varies widely, so double check it's compatible with your statistical software
    • Collected through research to help aid policy decisions
    • Also collected to help support the functioning of government work (such as the Food and Drug administration counting the number of adverse reactions to a particular drug, or the Occupational Safety and Health Administration counting workplace injuries).
    • Accessible Via: official government websites, TCNJ's Library Website, TCNJ's Library databases, dataset repositories.
  • Non-profits or non-governmental organizations (e.g., UNData - source for statistics compiled by the UN)
  • Dataset databases (aka "dataset repositories"; e.g., the Harvard Dataverse - a collection of datasets from across disciplines)
    • Usually subscription/fee based
    • Data generally available in formats like SPSS, SAS, Stata or Excel
    • Accessible Via: dataset repositories.

Some of the content in this box was adapted with permission from the University of Michigan Library's Finding Data LibGuide

Dataset Collections

Non-specialized

Subject-specific

Statistics Collections

Helpful TCNJ Libguides

TCNJ Librarians have already collected and annotated many sources for data and statistics. The guides below will contain links to resources, as well as short summaries of what those resources include.

Evaluating Datasets

Found datasets using the resources above? Click here for suggestions about how to evaluate them.