DataFerrett.census.gov link

DataFerrett: for the TheDataWeb  

A collaboration between the U.S. Census Bureau and the Centers for Disease Control     
 INSIDE TheDataWeb:

 DataFerrett Home

 What is DataFerrett

 TheDataWeb Home

 What is TheDataWeb

 TheDataWeb Browser:
 DataFerrett


 Datasets Available

 TheDataWeb Services

 TheDataWeb Publisher
 & Server Setup


 FAQ

 TheDataWeb HelpDesk:
 Toll Free: 866-437-0171

 DataFerrettTeam Email:
 dsd_ferrett@census.gov

 Use our Online Form
 for Comments,
 Questions, or Errors

User's Guide

Part 1 - Getting Started
Chapter 01 - History of the DataFerrett
Chapter 02 - DataFerrett at a Glance
Chapter 03 - System Requirements
Chapter 04 - Major Functions of DataFerrett

Part 2 - Microdata
Chapter 05 - Overview of Major Functions
Chapter 06 - Step 1: Select Survey & Variables Tab
Chapter 07 - Browse Variables Window
Chapter 08 - Step 2: The DataBasket Tab
Chapter 09 - Recode Window
Chapter 10 - Step 3: Download / Make a Table
Chapter 11 - Using the Geography Wizard
Chapter 12 - Make a Table
Chapter 13 - Make a Graph
Chapter 14 - Make a Map
Chapter 15 - Microdata vs. Aggregate Data
Chapter 16 - Application vs. Applet
Special Instructions for Extractions for SIPP Users

Chapter 1 - History of the DataFerrett

In 1997 the U.S. Census Bureau and the Bureau of Labor Statistics released the first version of a Federated Electronic Research, Review, Extraction and Tabulation Tool (FERRETT). This was a generalized search system for extracting and tabulating data across heterogeneous statistical data sources.

The current version of DataFerrett as an application aims to further enhance on-line statistical analysis and attract participation and statistical agencies at all levels of state, federal and local governments.

DataFerrett's Release Notes will give an overview of all the bugs, enhancements, and changes that have been made to the DataFerrett.

top

Chapter 2 - DataFerrett at a Glance

Choose DataFerrett Configuration Click the Windows Start button > Programs > DataFerrett Application > DataFerrett Configuration; The " Current Location: " indicates in bold letters which DataFerrett location is serving as your access point. The standard/normal location should point to Census Public [Bowie, MD].









After installing the DataFerrett Application to your desktop, Launch the DataFerrett Application by clicking on the DataFerrett icon.

Log In

  • The first step is to log in to the system. You will put in your exact email address which is required for extracting data and having it sent to your computer. Please do not insert false or incorrect email addresses to login to DataFerrett.
  • You will have to be connected to the internet to access data.
  • Certain types of users may have to obtain a password to access special data. They will have to uncheck the Public Use box and enter their password.

    Start

    The Start Tab is the screen you arrive at after you log in. The Start Tab is an information depot and a cover page for the DataFerrett Application.

    The Start Tab has several images on it:

    • You can select the rectangle DataFerrett image to go to the website for The DataFerrett.
    • You can select the oval TheDataWeb image to go to TheDataWeb website. The differences between the two are explained on either website which are connected.
    • You can select the Release Notes that gives information about the version, enhancements and bug fixes the DataFerrett Team has made to the application.
    • To begin looking for data, you can select either the "Step 1: Search DataSets by Topics or Themes" or the Microdata Tab (next to the Start Tab) to begin looking for and working with the registered data. These two options go to the data Selection screen.

    Microdata Tab

    The Microdata Tab screen contains three more tabs that display the Steps for completing a DataFerrett Session. This tab also lists datasets and surveys available and the searching options available for DataFerrett.

    Step 1: Select Dataset & Variables

    This screen allows you to search through the sponsoring agencies and surveys that have been registered with DataFerrett on TheDataWeb.

    Continue clicking on the plus sign(s) until you see a name/year with a bullet. Double click on a bulleted dataset and a Sponsor/Topics and Themes window will pop up. You can check as many boxes as you need to return the variables under that survey.

    If you have searched for variables by labels, names, themes, or descriptions, after selecting the dataset, results will be listed on the right.

    Select as many of the returned variables by using the shift or control key and then select the Browse / Select Highlighted Variables button to open the code book.

    Or, double clicking on an individual variable will open the " Ferrett Topics > window that will show topics for the selection. Check boxes on a topic or topics and select OK. This will return the variables that correspond to the topics selected. You will then highlight the variables you are interested in and select the " Browse / Select Highlighted Variables. " This will take you to the " Ferrett Browse Variable " window for actual selection and inclusion into your Data Shopping Basket.

    Ferrett Browse Variable Window

    You will be able to select all or specific values of the variable(s) to be included in your Data Shopping Basket

    You will receive a message each time you select OK as to the number of variables you have added to your DataFerrett Shopping Basket.

    Confirmation of Number of Variables added to DataBasket


    Empty DataBasket Step 1: The Select Dataset(s) & Variables screen gives you the option at any time to start over by emptying your DataBasket by pressing the Empty DataBasket button.

    The "I" buttonInformation Button gives instructions for the screen and how to perform the screen's main functions.

    Step 2: Data Shopping Basket

    The Step 2: Data Shopping Basket tab has several functions:

  • Recode Variables
  • Modify Variables
  • Delete Variables
  • Advance SQL Option
  • Change Longitudinal Period
  • Save Selected Variable(s) CodeBook

    These will be discussed at length in Chapter 8 of this Users' Guide

    Step 3: Download/Make a Table

    The Step 3: Download/Make a Table tab has two main functions:

    Download Data and Make a Table

    Download Data

    Downloading or extracting data has several format options:

    Download Data Options

    • SAS
    • SPSS
    • Excel/Access
    • Space Delimited
    • Comma Delimited

      These can be downloaded or extracted with:

    • No compression
    • DOS (PKZIP)
    • GNU (GZIP)

    You have the option of your downloaded data or extract being returned in batch mode or not. Batch is usually selected for large numbers of values and variables. This is one of the critical issues for logging into DataFerrett with your correct E-mail address.

    Make a Table

    Open Make a Table/Spreadsheet Option

    The Make a Table button gives you one of the most powerful features of the DataFerrett. The variables in your Data Shopping Basket appear to the right of the table grid.

    DataFerrett Tabulation/Spreadsheet Screent

    Drag and drop the variables in the right hand list over to the spreadsheet and click the GO button to get numbers back.

    Drag variables to spreadsheet animation

    Above the table grid are several additional functions:

    spreadsheet button options

  • Information
  • GO button (to actually retrieve the values (numbers)
  • Make a Graph (Gives several types of graphs: line, bar, stacked, etc.)
  • Map (Selected values)
  • TimeSeries graph
  • Clear table grid (to start over)
  • Sort (Ascending / Descending / Unsort)
  • Full view of grid (covers up the variables listed to the right)
  • Examine Underlying Records (depends on the submitted data)
  • Calculations are inserted under this "toolbar" and uses the Rows and/or Columns number designations in the formulas.

    All the individual Make a Table functions will be discussed further in Chapter 12 .

    Save the DataFerrett Session

    Save As drop down menu

    On all pages the drop down menus at the top of the screen(s) will allow you to save your DataFerrett Session DataBasket. In the Table screen you will be able to save your Table layout.

    Make a Graph

    Selecting cells of your table will allow you to create a graph of your data, with title and exporting capabilities.

    Graph image

    Make a Map

    With highlighted values within certain datasets containing geography variables you are able to create a map.

    Map image

    Help will direct you to a Getting Started Quick Tour,  Help Request   to email for support, the Known Bugs found within DataFerrett, Release Notes for updates and upgrade information, and the Version of DataFerrett you are running.

  • top

    Chapter 3 - System Requirements

    Computer

    • IBM PC or 100% compatible computer with a Pentium 100Mhz processor. Pentium 200 Mhz or higher is recommended.
    • 32MB of RAM. 64MB or higher is recommended
    • compatible mouse
    • Windows 95 or higher

    Drives

    • 11MB available disk space.
    • (Optional) 4XCDROM for installing the tutorial and Java plug-in from the DataFerrett cd (13.8MB)

    Monitor

    • SVGA monitor with a minimum setting of 800 x 600

    Connectivity

    • 28.8KBps internet connection (56KBps is recommended for comfortable use)

    Email Login

    The email login serves several purposes. Should you wish to be on an email list with the latest DataFerrett updates, the information will be forwarded to your email account. Also, the email login provides you the option of performing a DataFerrett job (containing large amounts of data) in batch mode that can be emailed to you upon completion. Upon completion of the job, you will be emailed with information about where you can obtain your file.

    top

    Chapter 4 - Major Functions of DataFerrett

    Microdata sources are collections of individual responses to questions asked once in a particular survey or census, or in a series of survey questions asked over time. The survey questionnaires may be completed directly by respondents or the responses may be derived from interviews during which questions are read and responses recorded by an interviewer. The information can also be answers to questions on an administrative form. To use microdata sources, you need to choose what responses, or variables, are available (for example, age and income). You will also want to define which population of respondents you will want to study (for example, employed women) according to what the possible answers were to the questions and how they are recorded (values). Values are the possible responses for each question or section of a form.

    For example, the form may have asked about motor vehicle ownership, with the responses being 1=Car, 2=Van, 3=Truck, 4=Other. You will need to know how the answers (values) are coded and think about how you might want to add categories together or compare categories of answers. Values may be ranges of numbers, amounts (e.g. dollars or ounces), or codes that represent certain categories (e.g. age ranges or number of years attended school).

    Values may be categorical, integer, or alpha/character. Responses can be summarized for specific topics to create specific views of the data. Do-it-yourself tabulations can be produced for any desired set of variables to study the characteristics of specially defined populations.

    This part of DataFerrett is what allows you to select specific items from surveys, specific values for those items, and then download the data or create your own custom tables and graphs.

    Microdata is data in which every record is at the unit of analysis level and all records must be added up to get the totals for each data item. For example, for surveys of individuals, microdata contain records for each individual interviewed; for surveys of organizations, the microdata contain records for each organization.

    Aggregate data is data which has already been summarized or added up, usually for specific geographical units or some other unit, such as industry classifications. In this case, each record is a geographical unit and there is no summing needed to get the totals for the geographies.

    Time Series

    When this part of DataFerrett becomes available, it will enable the user to obtain and analyze time series data from diverse sources. Specifically, it will support (1) searching for time series with user-specified criteria, such as geography, a demographic characteristic and/or industry, (2) reviewing detailed descriptions of the time series resulting from the search, and (3) based on the review, specifying particular time series to download for graphical and/or spreadsheet analysis.

    top

    Chapter 5 - Overview of Major Functions

    The Microdata part of DataFerrett consists of the following three tabs: Step 1: Select Dataset & Variables, Step 2: Data Shopping Basket, and Step 3: Download/Make a Table.  The functionality of these tabs will be fully explained in the following chapters.  Below is a brief overview of what these tabs contain.

    Step 1: Select Dataset & Variables

    This tab is the starting point of using the DataFerrett microdata.  This is where you will search for topics and items and select the survey you wish to use.  From this tab you also can explore items in more depth, select items and select specific values of items if you choose. You must select items before you will be able to do anything in the following two tabs.

    Step 2: Data Shopping Basket

    Once you have selected some items, this tab allows you to do several things. First, it is where you can see which items you have selected to use - these are the items in your Data Shopping Basket. You can recode a highlighted variable, modify your variable which will pop up the Ferret Browse Variable window, delete an unwanted/needed variable, or attach / adjust an advanced SQL statement option to any or all of your variables in your DataBasket. More detail is given in Chapter 8 on Recoding a Variable

    Step 3: Download/Make a Table

    After you have all the variables you want in your Data Shopping Basket, you go to this tab to get the data.  This tab allows you to either download the data to your own machine, or create your own custom tables and graphs. More detail is given in Chapter 10: Download Data and in Chapter 12: Make a Table.

    Please Note:  You must always begin at the Step 1: Select Dataset & Variables tab. However, once you have selected items and saved them to your DataBasket, you can go back and forth between all three tabs. For example, after creating a recode from your first selections, you can go back to the Select Dataset & Variables tab to search for more items to add to your Data Shopping Basket.

    Download / Extract Data

    Many data users use other software to work with data. DataFerrett has incorporated the capability to download into several formats as well as maintaining the DataFerrett FTP site. Chapter 10 discusses the Download capabilities of DataFerrett.

    Mapping Selected Data

    In DataFerrett certain geographically selected data can be displayed in a map. Chapter 14 has information on creating a map in DataFerrett.

    top

    Chapter 6 - Step 1: Select Dataset & Variables Tab

    The Step 1: Select Dataset & Variables tab under the Microdata tab is the screen used for selecting and searching through datasets for variables. This chapter will discuss and show you the many different functions available in this step of the DataFerrett data selection process.

    The "Select Dataset(s) to search:" pane of the tab will show which surveys are available by organization or dataset name. Click on the plus (+) sign next to the folders to open to the datasets for that title.

    The data is included in the branch that has the bullet. Double click on the bulleted title and all of the variables in that survey will appear on the screen to the right of the survey listings.  (note:  If the dataset has been published with the use of topics, then the Ferret Topics window pops up and you can select specific topics before it lists the variables to the right.)

    Search By Topic

    You can also find variables by opening the dataset with a 'double click' on the  bullet to view the topics supplied for that dataset.

    NOTE: to use data from a CPS Supplement survey with its matching Basic file, you must highlight the folders for BOTH the Supplement and corresponding Basic month.

    This type of data is Microdata. For more information on Microdata vs. Aggregate Data in Chapter 15.

    Search by Key Words

    You can find variables by limiting the search parameters by entering text in the boxes above Match ANY word. Use specific variable(s) names or the variable label for searches. Separate search words by spaces. (e.g., sex race age). Select the "Search ... by:" box(es) as needed. The default is the Search for Variables by: Labels.

    New Search Button

    Want to make a change? The New Search button allows you to clear your text from the search and to begin another text search.

    When you are satisfied with the parameters for the search,  click the Go button. 

    List of Returned Variables window

    The number of variables that are returned can be very large depending on your search parameters.

    The results of your search will appear in the columns next to the Select Survey(s) to search: pane will list the survey contents with its Topic (type of variable), the Name of the variable, its Availability (the period of time for when the variable is valid from) and the Variable Label (short description of the variable).

    With the return of large or small numbers of variables you can manage your view several different ways:

    • By clicking on the arrows indicated below, you can open up the list of Datasets pane or click the other arrow and see the full description of the variables.

    • Clicking in the gray header box you can drag "columns" into view or any order you choose.
    • Also, by clicking in the gray header box of each "column" you can alphabetize by that column;
    • The "Availability" column shows a year and a hyphen. If there is another year after the hyphen it means that the variable was available ONLY during those years. If there is no year after the hyphen then the variable is still being asked exactly the same way today.

    i Button

    The i button pop-up will give instructions for the Select Dataset & Variables Tab.

    Sorting may be accomplished by clicking on the column head. You may also drag the columns wider or narrower depending on the information you may need to view. The columns may also be rearranged in any order by holding down the left mouse key while clicking on the column head and dragging to the desired position.

    Highlight the variables you are interested in

    The Browse Variable Button

    Once you click on and highlight a variable(s) in the list of variables returned from a dataset search, you will see that the Browse / Select Variables button become active.

    To see which variables and values you have available to be selected, click the Browse / Select Variables button to view the variable's full description, universe, and response values specifically. You may also double click individually on a variable to browse its values and universe. This will pop up the Browse Variable window which will be discussed in Chapter 7.

    Empty Data Basket Button

    Want to begin again? Start Over? Click on Empty DataBasket if you want to begin the search process over from the beginning and this will empty your DataBasket and begin searching through another dataset.

    top

    Chapter 7 - Ferrett Browse/Select Variables & Values Window

    Selecting the Browse/Select Highlighted Variables button will open a new window that allows you to browse variable descriptions and values for all items that you selected in the Select Dataset & Variables tab. This is also called the Codebook. This window also allows you to select the variables that you want to be put into your Data Shopping Basket where you can recode, modify, or delete a variable. These functions will adjust your values/variables for extraction or tabulation.

    Codebook

    In this version of DataFerrett, your DataBasket will only hold variables from a single dataset. This will be discussed further in Chapter 8 .

    Ferrett Browse Variable Window Instructions:

    1. The variables you have chosen in the Step 1: Select Dataset & Variables tab are listed at the top of the Ferrett Browse Variables window. Highlight them one at a time to read the variable description (e.g. question text) and possible answer categories (values) in the pane below.

      Browse/Select Variables & Values window

    2. The scroll box next to the Ferrett graphic shows the highlighted variable's full description. This may include the full question text, the universe and any related variables. The information contained here is entirely dependent on what metadata was defined by the subject matter experts.

    3. The next box shows the possible answer categories for the highlighted variable. The above example shows the values for a continuous range variable.

    Add to DataBasket

    If you want to add the variable to your DataBasket to use in the Download/Make a Table step, select it by clicking the Select Variable Name box. If you want to keep all the variables you selected, check the Select ALL Variables checkbox

    1. Once you have highlighted a variable, you then have the option to subselect specific values of the variable to limit your universe. When selecting a continuous range variable, the screen then looks like:

      Browse Variable with continuous range

    2. Notice that the range of possible values has the minimum and maximum values in editable boxes.

    3. Simply highlight the min or max and type in the value you wish to change it to.

    4. The selection of values for a categorical variable is slightly different. If we select the PEMARITL variable in the top box of the window, we see the following:

      Browse Variables window - categorical variable

      Notice the possible values are listed with their value code and label.  If we click the Select CPS PEMARITL ... box to select the variable, we then get the following:

      Browse Variable window multiple selections

      Now notice that there are check boxes next to each possible value.  The default is that all values will be checked. If you don't want all the values for a variable, click the box next to the values that you don't want. Some variables will have hundreds of values (e.g., FIPS codes, ICD9 codes, etc). The button next to the select values box (Deselect all Values) will do just that, deselect all values. You can then go back and select just the ones that you want included into your DataBasket.

      Select ALL Variables

      Checking this box selects all of the variables that are listed in the top box of the window to be placed into your DataBasket. This is the quickest way to select multiple items when you know you want all variables and values for all the items you chose from Step 1. Also, you can check this box, then highlight a specific variable in the top box and then subselect values for that item.

      If you have returned a variable with numerous possible values and only wish to select one or a few, you can deselect the values with the "Deselect All Values" button on the right. Then you can select the one or ones you need.

      After you have selected all the variables that you want to put into your DataBasket,  click the OK button. You will receive a pop-up window that informs you how many variables you selected for your DataBasket.

    Once you have added variables to your DataBasket you will see in the Step 1: Select Datasets and Variables window that dataset titles may be grayed out. This means that the variables you have chosen are not comparable to other datasets and are therefore unselectable.

    See the next Chapter for information about the Data Shopping Basket .

    Now would be a good time to save your DataFerrett session. This can be done by selecting the disk icon on the toolbar or select File > Save As and place the file in an appropriate directory. The default will direct you to save in a folder called TheDataWeb. For more information on TheDataWeb see the website link What Is TheDataWeb.



    top

    Chapter 8 - Step 2: The Data Shopping Basket Tab

    In this version of DataFerrett, the Data Shopping Basket will only hold variables from a single survey.

    You can:

    • See all the variables you have selected.
    • Go back to the "Step 1: Select Dataset & Variables" tab to add more variables.
    • Go on to "Step 3: Download/Make a Table" tab to extract data or tabulate it.

    The other options in Step 2 can be executed by highlighting and selecting a variable from your list of Current Query Variables. This will cause the following options to become available: Recoding, Deleting, Modifying the variables as you want.

    Recode Variable(s)

    If you want to create new value groupings for your variables.
    • Highlight the variable you want to recode and click on the Recode Variable(s) button. Only one variable may be selected at a time. Recode will open the Recode Variables window. This will be discussed in length in Chapter 9.
    • Values can be collapsed into groups and renamed.
    • Recode Variables can also break down into groups of continuous numbers of a variable's values.
    • At this time, you cannot recode from another recode you created.

    Delete Variable(s)

    This function allows you to remove variable(s) from the Data Shopping Basket. Highlight the variable(s) you want to delete and click Delete Variable(s) button.

    Modify Variable(s)

    Modify will open the Ferrett Browse Variables window again and you can change values that were previously selected.

    Advanced SQL Option

    This function allows you to show the SQL routine that will be used to run your query. Advanced SQL opens the Advanced sql options to change clauses and add conditions in a standard SQL syntax.

    Save Selected Variable(s) Codebook

    This allows you to save the documentation for the variables in your Data Shopping Basket to an ascii text or html file. The documentation includes the variable name, label and value descriptions.

    top

    Chapter 9 - Recode Window

    What is a Recode?

    A recode is basically a new variable that you create by combining or collapsing the answer categories of an existing variable. For example, making an age variable that has values ranging from 1 to 100 into a variable that has 5 age groups of 20 years each (1-20, 21-40, 41-60, 61-80, 81-100). DataFerrett allows you to define the groups any way that you like.

    As another example, say you are interested in some differences between married and unmarried persons, and the dataset that you are using has a variable defining marital status as six possible categories like this:

    1. = Married, spouse present

    2. = Married, spouse absent

    3. = Widowed

    4. = Divorced

    5. = Separated

    6. = Never Married

    You can create a new variable that regroups these into just 2 categories, married, and not married, by combining values 1 and 2 into the married category and the rest into the not married category.

    Recoding a Variable - Getting Started

    In order to create a recode you must have the variable that you want to use as the basis of your new variable in your Data Shopping Basket

    • In the Data Shopping Basket Tab, highlight the variable that you want to use as the basis of your new variable. You can only select one variable at a time.
    • Click on the Recode Variable(s) button. This opens the Recode Window.

    Recoding a Continuous Range Variable

    A continuous range variable is one whose possible values are defined only as a range, each value is not labeled or defined. For example, age is often a continuous range variable because the possible values are for all ages. Variables reporting income are also usually continuous range variables. This is what the Recode Window looks like for a continuous range variable:

    First thing to do is to highlight the Label RECODE 1 and type over it changing the label to something relevant.

    Next to begin the recode process, highlight the " 90 " and change it to an age. I picked 17. Then hit the "Recode" button.

    Notice how the list of ages has changed on the right under Label.

    Now I've decided to group the rest of the ages to breakdown every 20 years. Type in 20 into the "Subgroups Repeat by:". Now Hit Recode. Notice how the Not Elsewhere Classified under "Label" has changed to include all the individual values into 20 year increments or values.

    Highlight the label name and change to something relevant.

    My results are as follows:

    This is how the Step 2: Data Shopping Basket tab window looks now. The original variable is still in the Data Shopping Basket as well as our new variable called My Age Recode.

    Recoding a Categorical Variable

    This allows you to regroup a variable's values from many categories to fewer. A categorical variable is a variable that has distinct categories as possible values, For example, the marital status variable we mentioned at the beginning of this chapter which has the values:

    1. = Married, spouse present

    2. = Married, spouse absent

    3. = Widowed

    4. = Divorced

    5. = Separated

    6. = Never Married

    The recode window for a categorical variable looks a little different from the window for a continuous range variable.

    1. Type over the text "Recode 2" in the upper left text box to label the variable something intuitive. Married / Not Married will be this recode Name.
    2. Highlight the values 1 and 2. Click Recode to break these two values to Set to Value 1.

    3. Highlight and type over the recode value 1 to say "Married"

    4. Highlight the categories of your variable on the left that you want to regroup into a single category. Highlight multiple categories by clicking and dragging over the categories or by using the Ctrl-click combination.

    5. Click the "Recode" button.

    6. The new values should appear in the list on the right. The left most column is the new value and the "Values" column on the right shows you which values from the original variable are contained in the new value.

    7. Type over the new value labels on the right so that the labels are more relevant.

    8. Continue as necessary.

    9. Click the "Ok" button to save the new variable in your Data Basket. Now when you view your Data Basket you will see the variable you recoded with the name RecodeX ('X' being a sequential number- 1,2,3 ...) and the label that you gave it.

    10. At this point you will want to Save your DataFerrett Session. This will save your DataBasket. Either select the "disk" icon at the top of the screen or select File > Save As > and type in a file name. This is advantageous to do for many reasons; you may need to stop working in DataFerrett or you can resume at a later time and update to newly released data.

    top

    Chapter 10 - Step 3: Download/Make a Table

    On the Step 3: Download / Make a Table tab there are two choices:

    1. Extract or download the data to your machine; or

    2. Tabulate the data in a spreadsheet.


    This chapter will focus on Downloading [Extract] data. Chapter 12 will focus on Tabulation.


    If you have selected a dataset that requires a geography selection, a warning will pop up when you select either the [Extract] or [Tabulate] buttons:

    Chapter 11 will go into the steps for selecting geography.

    Extract Button

    Extracting data means downloading the data for the specific variables that you have selected and saved in your Data Shopping Basket.

    Clicking on the Extract button brings up the following options:

    Download Data

    Check the box to Download Data to your machine.

    Choose the file format you want.

    • SAS (version 8.0 or higher) format will come with SAS input statements

    • SPSS format

    • EXCEL / ACCESS database format

    • Space delimited - If you are going to read it into a statistical package.

    • Tab delimited - if you are going to read it into a spreadsheet package.

    • Comma Delimited

    Click the [GET EXTRACT] button to get your data downloaded.

    Currently there are two compression methods available: zip and gzip.

    Display Data on the Screen

    This allows you to see the first 50, 100, or 200 rows of data in your extract. A new browser window appears with a link to the display. This is an ASCII type display with each variable in a column and the values for each row of data. For example:

    Running your Request in Batch Mode

    This allows you to run the job in batch and get the results later instead of waiting for the results. You will be sent an email message with a link to your data file when the job is complete. You might want to use this option if you are trying to extract a very large number of variables. If you choose this option a window pops up telling you the url to check later for your results. This is where the correct input of your login email address is important. Please do not insert false or incorrect email addresses to login to DataFerrett.

    Batch job location pop up window

    Working with SIPP Extractions using SAS

    The instructions included on the DataFerrett/SIPP CD give guidance to the user about how to save a DataFerrett extract as a SAS file. The recommended approach is to save the extract as a SAS file, close DataFerrett, launch SAS, and then import the saved extract into SAS for execution. This approach will minimize the likelihood that the user's computer might exhaust its memory resources.

    Some users may wish to provide a more automatic invocation of SAS by associating the file suffix of the DataFerrett extract with a particular version of SAS existing on their computers. It is important to note that SIPP extractions from DataFerrett should be processed in SAS version 8.x or higher. Earlier versions of SAS will not properly handle certain features of SIPP data, such as the length of labels. If you wish to create such an association to enable automatic invocation of SAS, please check with your LAN Support specialist regarding how to do so in the most appropriate way in your local environment.

    Tabulate Button

    This button opens the Make a Table window which allows you to create your own tables, maps and graphs using the variables you selected and placed in your Data Shopping Basket. The table functions are explained fully in the Chapter 12.

    top

    Chapter 11 - Using the Geography Wizard

    Selecting Geography is available in some of the DataSets in TheDataWeb for DataFerrett. You will need to select a geography for the Decennial DataSets and this chapter will be devoted to examples and steps for the Summary File 3 DataSet.

    If you double click to select the Decennial Summary File 3 - 2000 - SF3 Geography this Topics window pops up.

    For this example the following shows that the topic selected was "Selectable Geographies" and has returned the following variable [Geography Items] which has been highlighted.

    Double click on the highlighted variable.

    As you can see below there are several options that pop up in the Browse/Select Geographies GeoWizard.

    The FIPS State Code can be selected and the list of states will be available. For this example we want to go a little lower down in the hierarchy and select the counties in a state. Select Geography Unit: --FIPS County Code then click Next > .

    The GeoWizard now wants to know what FIPS State you are going to want the counties from. Select the FIPS State Code and click Next > .

    To select all states highlight the first one then scroll to the bottom; hold the shift key down and select the last state. You can select a random sampling of states by holding down the control key. For this example we are going to select the state of Arkansas and click Next > .

    Now select Arkansas in the STATE Selected pane on the right. Click Next > .

    This is now going to bring up all the counties in the State of Arkansas. To select the counties, again, select the first one (Arkansas County) and hold down the shift key as you scroll to the bottom of the list and select the last county, which is Clay County.

    Since we are done with this click on the Next > .

    Now all the counties are listed to the right as the geography for this query. Now, select [FINISH] to complete this GeoWizard Session.

    You are able to return to the Geo Wizard at any time to make changes to your geography selections. Step 2: Select the Geographic Characteristics variable and the click Modify (on the right menu bar) to modify your variable and bring up the GeoWizard again.

    In chapter we will go through the Chapter 14: Map Feature for the DataFerrett using our Geographic selection.

    top

    Chapter 12 - Make a Table

    Setting up the Table

    Once you have your variables in your DataBasket, left click on a variable and drag it over to the desired area on the spreadsheet as shown on the animation below. Repeat the process for the second variable. If you wish to change the variable names on the table simply click on the appropriate cell and edit the text. You may always go back and add more variables to your DataBasket. Populate your table by dropping the variables into Row 2 or Column 2.

    Once you have a question mark in the columns and rows of your table, click on the [GO] button to get numbers or select File > Get Data.

    The box under the list of variables gives the universe, weight, datasets and source of your variables.

    Creating New Columns Using Formulas

    You can add, subtract, multiply, divide, sum, square root across or down, rank, If, three conditions Greater Than, Less Than and Equal To, compare columns to columns, rows to rows. Highlight a blank column or row by clicking at the R# or C# level.

    In the formulas: Refer to rows as R1,R2..etc.
    Refer to columns as C1,C2..etc.

    There are three types of formulas:

    Computational

    Computational allows add, subtract, multiply, divide and square root. Simple conditions are allowed for columns or rows.

    Here are some examples:

    =comp(R2+(R2*R3))
    =comp((C2/C3)*100.0)
    =sum(C2,C3,C4,C5)

    Ranking

    Ranking allows nominal value significance of largest to lowest.

    =rank(C#)

    Simple Conditions =if(C2 > 0,C#,C#)
    =if(C2 < 0 ) then C# else C#

    DataFerrett computations are also capable of adding missing data, and is shown as [miss1]. It can adding missing data up to 5 instances: (miss1 - miss5) and they have specific values and specific meanings.

    Getting the Numbers

    Once you have your rows and columns set up, the cells in the spreadsheet will be filled with question marks. Press the Go button on the toolbar to get the numbers or from the drop down menu, select File > Get Data. For special formatting with decimal points, Edit > Format > will show options for formatting your numbers.

    Adding a New Dimension to a Table

    To add another dimension to a table simply go back to the Select Dataset & Variables tab and select another variable (such as sex). Add that variable to your Data Shopping Basket. When preparing your table, drag the new variable to the column or row and drop right on top of the existing column or row (which ever area you want another dimension). This is called nesting. The second or even third drop of a variable onto the existing variable will nest below the existing variable.

    You are also able to drop a completely separate variable(s) above or below a list of variables in the columns or next to a column. Columns and rows can have blanks between to show separation if desired.

    At this time DataFerrett does NOT have an undo function. If a variable is placed inadvertently, or a decision is made to change the layout, the table will have to be re-populated.

    Select the "Clear Table" button.

    Once all the cells in the spreadsheet associated with your variables are filled with question marks, press the Go button.

    If you can not see all the columns select the Spreadsheet Only button. This is an on / off toggle.

    Select the [GO] button to get your numbers.

    Saving Table/Spreadsheet for Later and Repeated Usage

    Tables may be saved for later usage. Select File > Save As>: to save the file. The variables from the Data Shopping Basket and the layout of spreadsheet will be saved as an .ftf file (Ferrett Table File). The numbers will not be shown when you reopen the table, but the [GO] button will be active. The Table layout may also be saved and opened by clicking on the drop down menu under [File].

    Copying from the DataFerrett Spreadsheet into other Spreadsheet Packages

    DataFerrett Desktop Application does not require a policy file. DataFerrett Applet users must install the DataFerrett Policy File in order to copy and paste data or create a user policy file.

    NOTE: DataFerrett Policy Install File

    DataFerrett users that have proxy server for their firewall will have to have this installed in WINNT. This will tell Windows to allow for copy, paste and save functions from DataFerrett. The policy file, which is a very small text file can be found at http://www.thedataweb.org/policy.html . Download and save the policy file installation program to your desktop. Once it is done downloading double click on the to run the install wizard.

    Graph Results

    Highlight the desired rows and columns you wish to graph (not including the variable names) and press the graph button. Chapter 13: Make a Graph will go into more details.

    Tabulation Menus and Buttons

    Drop Down Menus:

    FILE > New will create a new spreadsheet window.

    OPEN > Open will open a previously saved .ftf file. The .ftf file is the spreadsheet layout without the numbers returned. It will have saved the variables from the DataBasket and the variables as you placed them in columns and rows.

    OPEN IN NEW WINDOW > This works similarly to a Open New Window.

    SAVE > will save your spreadsheet as an .ftf file that retains the layout and DataBasket of variables. This can be called up later.

    Debug is for DataFerrett Developers and is not a function for users.

    The print function for DataFerrett is set up with a default of landscape orientation and multiple pages. This will print out in normal size. If it is desired that the you would like your table all on one page, the print function will "squish" everything into a single page.

    Highlighted rows or columns that are deemed unnecessary or not needed can be "hidden." This will be reflected by a jump in the column or row number. Highlight the column or row by clicking on the C# or the R#.

    Unhidden will return the column or row to view.

    Delete will delete whatever is highlighted.

    Insert will insert a row or column at the cursor point or next to the highlighted column/row.

    Format will allow the formatting of numbers of decimal places and show numbers in tens, hundreds, or thousands.

    DataFerrett's formula bar will do means calculations and the standard error formula weight of the counts, weighted count of what you are tabulating. As with all formulas in DataFerrett you have to highlight a column or row by clicking the C# or R# and type in the formulas with the syntax as shown above.


    These little triangles will open or close the spreadsheet and works just like the [Spreadsheet Only] button on the toolbar (as shown below). This will "hide" the variables and give a full view of the columns. Click on the direction of the arrows to open to the right to cover the variables, or to close, choose the arrow pointing to the left to reveal the variables and universe.

    Menu Bar

    This section is geared to discussing some of the other functions of the menu bar for the Tabulation spreadsheet.

    This button will clear the entire spreadsheet of all the data. At this time, DataFerrett does not have an UNDO function, so careful construction of a spreadsheet is required. But, this clear button will allow the opportunity to change layouts. The DataBasket will remain and not be cleared, just the spreadsheet graph area.

    Select File > Save As to save the table layout and Data Shopping Basket.

    top









































    Chapter 13 - Make a Graph

    Change the Type of the Graph

    The type of the graph used can be changed by clicking on the graph drop down menu and choosing the desired type of the graph.

    Adding a Title

    By clicking on the Title button on the graph screen, you will then be prompted to enter a title for the graph.

    Adding a new Variable into the Graph

    To add a new variable go back to the Select Database & Variables tab and find a new variable. Add that variable along with your pre-existing variables to your databasket. Then regraph the old data long with the new variable.

    Saving a Graph

    When saving a graph you can either choose to save it as a .graph file or you can export it as a .jpeg file. To save the file with a .graph extension click on the disk icon and specify the area to where you wish to save to. To export the graph as a .jpeg file, simply click on the export button and specify the area to which to save to.


    Printing a Graph

    Click on the print icon to print the graph to a connected printer.

    top

    Chapter 14 - Make a Map

    The DataFerrett Map is a new feature developed for the Make a Table / spreadsheet screen in the 1.1.7 version of DataFerrett. This feature is activated for datasets supplied with geographic data which was selected using the Geography wizard. Begin by highlighting cells of your table that have geographic data.

    Select the map button on the toolbar.

    For this example we are using Arkansas data, Males in the Labor Force in all counties of Arkansas. The following shows the map of Arkansas with Counties. The darker the color of the county the larger number of Males in the Labor Force.

    You can select the [View] drop down menu and select [Legends] which shows the color differences.

    The default number of differences is 7.

    By changing the legend to 9 it makes more dramatic color differences. A lesser number will show lesser differences.

    Under [Layers] different Layers of geographic features include highways, water, and roads can be added to your map.

    top

    Chapter 15 - Microdata vs. Aggregate Data

    Microdata vs. Aggregate Data

    Microdata is data in which every record is at the unit of analysis level and all records must be added up to get the totals for each data item. For example, for surveys of individuals, microdata contain records for each individual interviewed; for surveys of organizations, the microdata contain records for each organization.

    Aggregate data is data which has already been summarized or added up, usually for specific geographical units or some other unit, such as industry classifications. In this case, each record is a geographical unit and there is no summing needed to get the totals for the geographies.

    top

    Chapter 16 - Application vs. Applet

    For users that have a proxy server, the DataFerrett Applet should be installed and run with your internet browser. The proxy server does not all installation of software to a users computer.

    To determine if you have a proxy server:

    • Microsoft Internet Explorer: TOOLS > INTERNET OPTIONS > CONNECTIONS > LAN SETTINGS If you have a checked box in the "PROXY SERVER" section you have a proxy server.

    • Netscape: EDIT > PREFERENCES > ADVANCED > PROXIES If you have a selection in Manual or Automatic proxy configuration, you have a proxy server.

    It will require the services your System Administrator with special privileges installing new software, usually. The DataFerrett application is actually created using the same code as the Applet, it is just that as an applet user with a proxy server you can/will need to set up DataFerrett and take a couple of extra steps.

    DataFerrett is created using JAVA. Since we are all generally using a Windows environment, you will have to give Windows permission to allow the java to perform functions for you. This sounds a little intimidating, but it isn't really.

    1. First go to the following link and download the DataFerrett Applet installation file to your computer DESKTOP. This file is only 5.2MB.
    2. Double click on the icon that is created on the DESKTOP to run through the installation wizard.
    3. Once this is complete you will want to install the "permissions" for Windows. Again, go to DataFerrett Applet Install page and click on the link " Copy, Cut, Paste & Save in DataFerrett". This will put another icon on your desktop.
    4. Double click on the icon (j2re1_3_1_01-win.exe) The DataFerrett Applet will place the following file in your WINNT folder in a JavaSoft folder. OR, you can create you own

      Creating your own Permissions File

    To create your own permission file simply open up a text editor (Notepad, Wordpad, etc.) and paste the following code into it:

    /* DataFerrett Java Policy to allow copy/paste, print, and save */
    /* DO NOT EDIT */
    /* Version 1.01 */

    grant codeBase "http://dataferrett.census.gov/-" {
          permission java.io.FilePermission "<<ALL FILES>>", "read, write, delete, execute";
          permission java.lang.RuntimePermission "queuePrintJob";
          permission java.util.PropertyPermission "user.home", "read";
          permission java.awt.AWTPermission "accessClipboard";
    };

    grant codeBase "http://ferret.bls.census.gov/-" {
          permission java.io.FilePermission "<<ALL FILES>>", "read, write, delete, execute";
          permission java.lang.RuntimePermission "queuePrintJob";
          permission java.util.PropertyPermission "user.home", "read";
          permission java.awt.AWTPermission "accessClipboard";
    };

    Save the file as ".java.policy" and place it into the following Windows directory depending on your operating system:
    C:\Winnt\Profiles\uName on multi-user Windows NT systems
    C:\Windows\Profiles\uName on multi-user Windows 95 systems
    C:\Windows on single-user Windows 95 systems

    Now you are ready to start DataFerrett. Go to the DataFerrett website page with the Netscape and Internet Explorer icons. Choose the browser you use and DataFerrett will begin.

    top

    Working with SIPP Extractions using SAS

    The instructions included on the DataFerrett/SIPP CD give guidance to the user about how to save a DataFerrett extract as a SAS file. The recommended approach is to save the extract as a SAS file, close DataFerrett, launch SAS, and then import the saved extract into SAS for execution. This approach will minimize the likelihood that the user's computer might exhaust its memory resources.

    Some users may wish to provide a more automatic invocation of SAS by associating the file suffix of the DataFerrett extract with a particular version of SAS existing on their computers. It is important to note that SIPP extractions from DataFerrett should be processed in SAS version 8.x or higher. Earlier versions of SAS will not properly handle certain features of SIPP data, such as the length of labels. If you wish to create such an association to enable automatic invocation of SAS, please check with your LAN Support specialist regarding how to do so in the most appropriate way in your local environment.



     |  Microdata Tutorial  |  Longitudinal Tutorial  |  Aggregate Data Tutorial  |  DataSet Topics  |  Users' Guide  | 
     |  What is DataFerrett  |  Install DataFerrett  |  GoTo DataFerrett  | 

    Last update: 05/19/2004