We are starting with my in-progress Google Sheet and will skip several steps of the Workflow in the interest of time. I will briefly demonstrate the corpus analysis toolkit AntConc for later exploration; only Google Sheets is needed for this workshop.

For this workshop, please make a copy of the Google Sheet for your work.

While not needed for the workshop here is the link to a Google Drive Folder of txt versions of the cookbooks

This workflow is inspired by The Data-Sitters Club

Students are welcome to use and modify the spreadsheet for non-commercial research purposes with attribution under this Creative Commons license

Workflow

  1. Decide on initial cookbooks 

    1. Consider dates of cookbooks, look into relevant copyright considerations

    2. I chose texts from Project Guttenburg that had been proofread after the texts had been through Optical Character Recognition (OCR).

    3. Consider where you want to start your research, knowing it will likely change during the process.

    4. Record what you expect to explore in the cookbooks

    5. Use The Sifter to assist with cookbook decisions

  2. Open the Plain Text version of each cookbook and copy and paste into a text editor, changing font color as needed, one at a time.

  3. Create a Google Sheet, including a header row, freeze header row

  4. Decide on Initial Column Headings, for this workshop I have set Column Headings based on my own research interests.

  5. Create a food plan for yourself

  6. Delete table of contents, title page, indices, and prefaces in the txt document

    1. Record what gets deleted in a data handling document

  7. Save as a .txt document in a dedicated, findable folder

  8. Open each document, one at a time into AntConc. Explore file in AntConc 

    1. Note which cookbooks have more words and more unique terms

  9. Navigate to the “Word List” tab and hit the “Start” button

  10. Make sure the header of AntConc is set to 5000 and copy the “Word” and “Freq” columns to Google Sheet. Copy the “Word” column both to the “Original Term” and “Standardized Word “in the Google sheets

  11. If needed, hit the green arrow button in AntConc and continue to copy and paste additional terms and frequency

  12. Complete the Cookbook Title, Author, and Cookbook Year Columns in Spreadsheet if using these Headings

  13. Add “Term Type” Column to Google Sheet, and set it to “Stop Word.”

  14. Add more rows to Sheet as needed

  15. Start thinking about Term Types

  16. Once Term Types start to feel finalized consider creating dropdown menus

  17. Notice any biases that are showing up in your Term Types

  18. Set a column for “Further Research” and set to “No”

  19. Start categorizing terms

  20. Create standardized terms when stuck; for example, okra and ochre are both the standardized term okra

  21. When stuck, exercise or find another way to clear your head

  22. Create a Notes field to record frustrations

  23. Create extra columns based on research interest

  24. Start a Data Dictionary on a separate sheet

  25. Return to cookbooks in AntConc and examine terms that need more research

  26. Take a deep breath

  27. Brainstorm about where to start on Pivot Tables

  28. Record Your experiences throughout, especially what is causing you to become frustrated. These frustrations are generally a great starting place for further research. Use The Sifter and return to the cookbook texts in AntConc as needed.

  29. Look at secondary sources to answer questions and explore frustrations

  30. Start creating Pivot Tables

    1. Google Sheet power users disregard this section and create Pivot Tables based on your own workflows

    2. Note: I generally use Tableau Public for my research visualizations

  31. Go to Insert Tab and scroll down to Pivot Table

  32. Create a Pivot Table on a new Sheet

    1. Set “Rows” as “Standardized Term”

    2. Set “Columns” as “Cookbook”

    3. Set “Frequency” to “Frequency”

    4. Set “Filters” as “Fats” and use the dropdown menu to deselect “Blanks”

  33. Select stat in the Pivot Table, not including the Grand Total

  34. Click the chart button on the right side of the sheet ribbon

  35. Consider what could be improved in the Chart

    1. Edit the Pivot Table by clicking on the Edit button in the bottom right corner below the Pivot Table 

    2. Go to Chart Editor by clicking on the three buttons in the top left corner of the chart

  36. Return to Google Sheet to Fix typos found (not for this workshop)

  37. Explore the Chart Editors and Pivot Table Editor

  38. Create your own charts

  39. Frustration is a useful part of the process

  40. Download any useful charts

  41. Consider what further examination of the spreadsheet is needed to explore questions raised by the data

  42. Consider what additional research can assist in exploring questions

    1. For my research, I needed to check where pineapple was printed in the cookbooks as “pineapple” vs. “pine apple”