I am using datasets provided by Prof. Kane that list economic transactions between Dutch fur traders and their indigenous trading partners that conducted business in Albany, NY. Most of these exchanges involve furs of various mammals, including bears, martens, muskrats, and most notably, beaver. These datasets represent primary documents written by Jelles Fonda, Evert Wendell, and an unknown Dutch trader who operated in the Hudson Valley. These account books record more than just fur transactions, as all three traders also included comments on the appearance of their partners, their kin, where they live, how much debt they owe, and other trade goods they exchanged. There are some non-native traders included in these accounts, but not many.
I cleaned the Wendell dataset by fixing typos and combining records such as “?” and “???”. I matched up inconsistent abbreviations for the dates, like “Sep” “Sept” and “September”, all of which seemed to appear equally in this dataset. Annoying. For all four datasets, I spent a long time combining records in the “Trade Goods”, “Items” and other similar columns. I decided to just put “1 bottle of rum”, “3 casks of rum” “2 rum” all under 1 record: Rum. I decided to do this because I will be able to gain more information from a broader column like that, where all the records that just pertain to a certain amount of rum being traded can all be analyzed together as opposed to under smaller categories like “1-3 casks of rum” or “5 or more casks of rum” or something like that. I’m not interested in the quantity here, I’m interested more in the content. Same with “big knife”. I edited that to “knife” only.
For the Tribes column, I eliminated question marks. I decided to combine “Mahican” and “Mahican?” because I figured something labeled “Mahican?” is already, in my mind, associated with “Mahican” so the data is already skewed in that way. The question mark implies it is uncertain or not confirmed, but we can not be completely sure about all this data anyway since no dataset is perfect or completely accurate. The traders writing these could’ve made their own errors. This is also why I edited the year records as well, such as “1698?” to “1698”. I changed a lot in the Town column as well, matching “of the Catskill area” with “Catskill” because both imply origin. I made “Canadian” and “lives in Canada” the same thing as well for simplicity’s sake. “Mohawk who lives in Seneca country” and “Seneca lives in Cayuga” became “Mohawk who lives among Seneca” and “Seneca who lives among Cayuga”. I wanted to keep the wording as close as possible to create the cleanest of datasets. Lastly, I moved some things that were in the wrong rows. For example, I moved “escorted by the Greyhead’s wife” from tribe to affiliation where it matched other records.
I changed this data by eliminating the pages field for all datasets. There probably is something interesting there if I decided to look at whether it was in chronological order and if not, if certain trade transactions were included earlier than others and if that could reflect some sort of bias on the part of the trader, but there’s no way to know for sure if the records were organized on pages completely randomly.
Another big thing I changed was the Date column. There were some records that only included the year, some with just the year and month, and some with the year, month, and day. First, I separated everything in the column by semicolons so it looked like “Sept;18;1709” and then I split the columns based on the semicolons. So I ended up with three extra data columns. Next I eliminated the column containing the days of the month because those aren’t helpful to me and only a small portion of the records actually contained this information. Next I organized the months and years into separate columns. That took some time because they didn’t split very evenly. But OpenRefine has an “edit all matching cells” option so that when you edit the year for one row, you can apply that same edit to all matching rows. Once I figured that out, it went a lot faster.
<script type='text/javascript'> var divElement = document.getElementById('viz1618017989072'); var vizElement = divElement.getElementsByTagName('object')[0]; vizElement.style.width='100%';vizElement.style.height=(divElement.offsetWidth*0.75)+'px'; var scriptElement = document.createElement('script'); scriptElement.src = 'https://public.tableau.com/javascripts/api/viz_v1.js'; vizElement.parentNode.insertBefore(scriptElement, vizElement); </script>
Here are two sample visualizations I came up with. The first one is from the Wendell dataset. It shows the number of beaver furs traded each month and filtered using a year slider. Tribe is visualized by color. 1710 yields the most data here, almost creating a bell curve with July having the record number of beaver furs Wendell received that year (64 furs). This year also appears to represent the most native groups, but there are other years that include this diversity as well (1704, 1706, and 1707, to name a few).
The second visualization is from the Jelles Fonda account book held at the New York Historical Society. Here I visualized the number of beaver furs traded by the top eighteen traders that Fonda worked with. Interestingly, the top two are non-native traders, as you can see by the color differentiation. I’d like to work a bit more with this dataset and pick out more variables to visualize, such as changes in the top traders over time. Unfortunately, this dataset does not have a gender column, but I can always try to make one based on the Name column that indicates “wife of…”.
<script type='text/javascript'> var divElement = document.getElementById('viz1618018745349'); var vizElement = divElement.getElementsByTagName('object')[0]; vizElement.style.width='100%';vizElement.style.height=(divElement.offsetWidth*0.75)+'px'; var scriptElement = document.createElement('script'); scriptElement.src = 'https://public.tableau.com/javascripts/api/viz_v1.js'; vizElement.parentNode.insertBefore(scriptElement, vizElement); </script>
2 replies on “9: Work”
I mention this not to embarrass you, but the version of those spreadsheets you’re working with are the versions I transcribed from the print or archival sources for my dissertation research before OpenRefine was a thing (and I agree, the typos/different spellings are annoying, ha 🙂 )–but just an fyi that much of what looks like annoying variance in Sep/Sept/September is because that’s what’s in the original. It is necessary to clean those, but I just wanted you to be aware of where they’re coming from.
I mention this also for your context, that some of what you’re working with like Mahican? is my conclusion, rather than Wendell or the anonymous Dutch trader’s. So you’ve got another layer of mushy historical distance to keep in mind. This accounts for some of the randomness of the eg town column–the original manuscripts and translation of the Wendell account book aren’t ordered in any straightforward way, so part of my transcription process was trying to figure out what columns were even necessary. I haven’t got images of the Wendell account book handy, but the Fonda_Jelles Old Fort Johnson.pdf in that dropbox folder I shared with you earlier this semester is a good example of how unordered these are. Which is a long way of saying–good for you moving around information between columns as necessary; it’s part of the process of clarifying your research question. I just want you to be aware of the process if you finding yourself doing this with other account books (and your own transcription) in the future, and how your research process has been shaped by the questions from my work. (At the time, I was interested in inter-Haudenosaunee movement between nations and towns of residence, for example).
Wendell is available in published translation (https://www.worldcat.org/title/to-do-justice-to-him-myself-evert-wendells-account-book-of-the-fur-trade-with-indians-in-albany-new-york-1695-1726/oclc/758872535&referer=brief_results)–we don’t have it in the library and I don’t expect you to track it down for this class, but it’s something to keep in mind if you use this in the future for dissertation work. The page numbers in Wendell are the archival pages they appear on in the original manuscript held by the NYHS, which are included in the published translation. There does seem to be some order–later visits are on later pages–but it doesn’t seem to have any great meaning; just something to keep in mind for citations of individual interactions and for later reference if you find yourself needing to track down specifics.
Similarly, the Wendell account book was translated by Kees-Jan Waterman; the Fonda and anonymous Dutch account books were translated from Dutch by me, so while the English is accurate, don’t get too hung up on wording.
Something to think about for the seasonal bell curve is splitting it by gender. I only did this with the Wendell data for my book, but there is a distinct difference in women’s and men’s seasonal movement. Breaking your nations out into separate visuals (eg, a dashboard of small multiple bar charts) may make it easier to see national differences.
There’s a lot of wholesale trade in the Albany area (eg, Fonda bought and sold to/from white traders who also bought/sold from Indigenous traders) so my assumption is that John Thomas and David Van Debarrack are some of those, but I haven’t pursued that. The Robert Sanders accountbook in the dropbox folder is an example of a wholesaler who dealt only with other white traders; William Johnson also did some of this early in his career.
For my troubleshooting purposes, did you use an HTML block when you went to embed your tableau visualizations? I’m trying to get the issue fixed where it embeds an image and shows some code, rather than embedding the interactive chart like it’s supposed to. I thought I’d had it fixed, but it looks like the issue is still there (through no fault of yours, I’m just trying to track down the issue).
I figured those typos/different abbreviations were in the original documents! I just went back into the dropbox folder to look through those Fonda images, thanks for those. I do plan on splitting my gender more. As for the white traders, I didn’t really think of looking into that at first but now I’m going to.
I did use an HTML block to embed my visualizations here! I thought it would be interactive too. But I saw you posted a how-to on it, so I’ll try that for this week.