Ben Gordon ’19 and Charlotte Nunes, Director of DSS, Reflect on Data Science at Lafayette

Hi everyone,

I wanted to make sure that you all got a chance to read this terrific blog post by Skillman Library’s Digital Scholarship Services about our very own 2018 Summer Scholar, Ben Gordon, and his experience as a Data Science Major here at Lafayette, as well as his DHSS digital research project,  New York City’s Subways, Bridges, Highways, and Expressways in the 20th Century.

Angela

I’m reposting the piece here:

 

January 30, 2019
Ben Gordon ’19 and Charlotte Nunes, Director of DSS, Reflect on Data Science at Lafayette

 

My journey to Digital Scholarship Services and Data Science has been a long and rewarding one. I came into Lafayette with an interest in Math because of a statistics course I took in high school. I was always successful in my math classes, but often didn’t understand the relevance of geometry, algebra, and trigonometry. Yes, problem solving was interesting, but for what purpose? This changed when I immersed myself in that statistics   course. I truly fell in love with the applied nature of the discipline, using mathematical equations in order to find meaning in the real world. Assignments weren’t just problem solving, but included some form of essay writing to explain what our mathematical answer meant.By the time I was a freshman in college, this course had stuck with me and I was itching to continue. I was hoping for an applied version of mathematics in college, deciding I would be a major in math. I did not know what engineering might have entailed, although that might have ended up more of what I was looking for, so I ended up in math. However, when I took transition to theoretical mathematics, the infamous “weed out” course in the math major, I found it challenging to connect with the material.

This experience prompted me to try to forge my own applied statistics path in the major, so I met with the head of the math department. He understood my difficulties with the theoretical nature of the required classes for the major, and told me about how the future of the department included new statistics teachers and maybe a statistics major. But there was no statistics major currently.

The next semester, I decided to switch into a computer science major as a junior. The summer before my junior year, I made sure that I was going to approach my Data Structures and Algorithms course (the “weed out” course in the CS major) much differently than I did when I was a math major. I spent my summer taking notes from the textbook when I rode the subway to and from work, and practicing coding on my computer. I wanted to put myself in the best possible situation to succeed going into the class.

Yet when I started the class, I got failing grades on my first two labs. I did all I could possibly do to prepare for this next step over the summer; what was I missing? Why was I still failing? I met with my teacher and tried to work through the mistakes, setting aside most of my time here at Lafayette just to passing this class.

Eventually, I hit my stride, and started getting passing and above grades on my labs, tests, and projects. I finally felt some sort of relief, that my change in attitude towards school, combined with a new interest in what I was learning was going to get me through this class, and subsequently the computer science major. But one day after class, when I did not expect it, my professor approached me and asked about what my plan was for the next two years. I explained what I have written above; I was driven out of the math major because I wanted to do statistics, and was going to try and squeeze a computer science major into four semesters.

He said that he thought I could still make my own Data Science major. I was ecstatic – Data Science sounds a lot like statistics. Finally, after trying a year ago and giving up, I was going to be able to make my own major and do the discipline I actually wanted to. Then I met the director of Digital Scholarship Services, Charlotte Nunes, who told me there was an academic planning committee for Data Science & Digital Scholarship. I was excited, and realized that I was actually the guinea pig for Data Science at Lafayette College.

This long process was how I got introduced to Digital Scholarship Services in the library, and all of the different intersections between Computer Science, Data Science, and what is being done in DSS. I started then to do research for Charlotte, on topic modeling and different text analyses in R. Matthew Jockers is one of the leading scholars for this buzzworthy subject in the field of digital scholarship. We spent the semester working through Jockers’ how-to book and discussing how he was received by scholars.

This is a long and winding story, but I ended up in Data Science and Digital Scholarship Services for the same reasons I wanted to study statistics in the first place. I was interested in understanding the world around us in ways that were only possible with mathematical and technological methods. And Digital Scholarship Services is building capacity in this area. Topic modeling and Matthew Jockers’ scholarship, for example, is all about trying to discover new aspects to text that would be impossible without technology.

Response from Charlotte Nunes

Ben Gordon’s research project, New York City’s Subways, Bridges, Highways, and Expressways in the 20th Century, offers a great example of how Lafayette College Libraries supports data-oriented undergraduate and faculty research. Ben completed the project under the supervision of Angela Perkins, Research & Instruction Librarian and director of the Lafayette College Libraries Digital Humanities Summer Scholars (DHSS) program. As part of this program, Ben consulted with members of Digital Scholarship Services including John Clark, Data Visualization & GIS Librarian, on geospatial data discovery, research data management, data transformation, and data visualization. Ben currently collaborates with Janna Avon, Digital Initiatives Librarian, and I on a text analysis project featuring oral history transcripts.

In the academic year leading up to his DHSS project, I appointed Ben as a student worker to assist me in clarifying where DSS might build on departmental strengths to better consult on introductory data science methods for analyzing data in the humanities and humanistic social sciences. Together we explored data science as a varied, multidisciplinary field involving data analytics, data visualization, and data ethics. The field requires skills in finding, cleaning, and organizing data, articulating research questions, drawing interpretive conclusions from statistical inference, and communicating persuasively about the results of data analysis.

As Lafayette College advances its Data Science & Digital Scholarship academic planning initiative, I anticipate that DSS will continue to provide wide-ranging data services while growing in new areas, including:

  • Providing consultation and workshop instruction on humanistic uses of R, a statistical computing language that allows for a variety of modeling, clustering, and visualization techniques in text corpora.
  • Building research-ready digital archival collections guided by the principle of Collections as Data, and consulting on data mining in primary source databases such as Adam Matthew Digital.
  • Exploring uses of artificial intelligence and natural language processing for humanistic data analysis, as addressed in the Institute of Museum and Library Services grant Investigating the National Need for Library Based Topic Modeling Discovery Systems.

I thank Ben for his work exploring the field of data science, building skills as a practitioner, and helping to set a vision for the future of DSS. Check out the Lafayette News coverage of Ben’s research titled Uncovering Political History of NYC Subways!

 

Monumental Reflection, 1

“Every dictatorship, whether of man or of party leas to the two forms that schizophrenia loves most: the monologue and the mausoleum. Moscow is full of gagged people and monuments to the Revolution.”

– Octavio Paz

One can tell a lot about a nation not just by what is creates, but also by what it destroys. Despite how controversial Communist monuments in post-Soviet Eastern Europe are, they remain as testimonies of national perseverance and evolution. They have stood as silent witnesses to the peaks and troughs of political regimes. With cold, somber eyes, many of these Communist survivors have observed the dynamically changing social values that have brought about political, historical, and social transitions.

Albeit imbued with state ideology, they are art, built upon the empty thrones of fallen dictators. Memorials to students who stood in the path of tanks, to soldiers who bore the deepest wounds of war, to parties that built national history. The people esteemed as heroes in the eyes of some, yet condemned as malefactors in the eyes of others. They are paved over. Renamed. Blown apart. But sometimes … Sometimes, they survive, and each and every one of them has a unique story to tell the world.

Growing up in Eastern Europe, I watched as monuments were abandoned, ridiculed, destroyed by human aggression and passage of time. But I also saw tourists who stood in awe in front of bigger-than-life sharp figures. I watched as youth furiously scrubbed away the contempt painted upon men who, whether for good or for bad, made history. I gazed at how piece by piece, meaning was brought back to monuments that had been left behind.

These are all reasons why this Digital Humanities project is so meaningful to me. How does society’s perception of ideological monuments change? Throughout the years, what are the values that these monuments have carried? Why are monuments destroyed, and what sparks the desire to bring life back to them? How does the digital world play a role in the treatment of physical world edifices? I will attempt to answer all of these questions in the 6 short weeks that I will be conducting research.


One of my biggest aims is to create a digital record of Communist monuments in post-Soviet Eastern Europe. I foresee that I may not be able to track down all the monuments due to limits of time, but I will try to include at least a dozen monuments from each represented country. After the formal timeline of my research reaches an end, I plan to further work on this, until at one point I have mapped down all the existing monuments. Additionally, I have pondered whether to add photographs of each monument, but since I will face a lot of copyright issues in finding usable photographs, it is not definite whether this will happen.

When I have an initial list of the monuments that I will include in the visual database, I will begin looking for primary and secondary resources, in order to explain the values that they have been imbued with throughout history: reasons for building, history of the depicted figure (if human), treatment of the monument, popularity as a visiting site, and lastly, popularity on the internet. Evidently, not all monuments have been treated in a similar fashion, so I may not be able to track the ever-changing values of every monument in the database. However, I will be ready to compromise and, at least as a base level, create a categorization of all the monuments.

After examining those features, I hope to reach certain conclusions about these monuments, based on the aforementioned questions. For instance – the different values imbued at different periods of time. Additionally, a major part of my monumental examination is focused on the role that the digital world plays in the preservation of a nation’s architecture, culture and history.

Throughout this whole process, I will try to be as flexible as possible due to the research question’s large scale and the consequent time-consuming data collection. The idea of a visual database, including categorization and examination of at least a dozen monuments from each country, is important to me, and therefore I will do my best to adhere to it. However, I am willing to compromise on my ideas regarding everything else.

Hopefully, I will face only minor bumps on the road as I embark on this ideological Eastern European journey!