Digital Humanities Tools for Beginners and Non-coders

Dear beginners to Digital Humanities and non-coder academics,

I have some good news for you: you can begin your digital humanities project and explore different tools before you learn to code or not learn to code at all. As scholars, we continue to learn and decide which digital skills are best suited for our research and projects. We may learn a skill and later decide our efforts are best spent honing a different digital skill or methodology. The most common coding languages we, the GCDFs, use and teach are R and Python. But, until you decide to learn either, both, or another coding language, there are tools you can use to execute your digital research and projects. 

A tool that many of us have encountered and that we might not necessarily think of as a DH tool is a spreadsheet softwares in which data is arranged in rows and columns and can be used to make calculations or re-organized to reveal patterns. Spreadsheets are a great tool to store, organize, clean, analyze, and even create simple visuals of data points. They are a helpful beginner tool that can assist you in deciding if you need to use more dynamic systems like databases or write code to perform more complex analysis and synthesis of your data. 

A tool that utilizes the simple spreadsheet software is Knight Labs Timeline. TimelineJS is an open- source tool that allows you to visualize your data into an interactive timeline to include text, maps, images, and audio! An example of a project that uses Kinght Labs’ Timeline is Jenna Queenan’s 20 Years of History: New York Collective of Radical Educators

Another way spreadsheets can be used is to analyze text. Both Google Sheets and Microsoft Excel feature “Analysis Content” add-ons to allow users to conduct sentiment analysis or topic detection. However, what if you have a machine readable text and you want to use it as a corpus to conduct a text analysis? (A machine readable text: an image, handwritten, or printed text encoded into a digital data format to be machine recognizable. Think of a document where you can highlight individual characters as opposed to a document where you are unable to highlight a single character and the entirety of the document is highlighted because the machine recognizes it as one large character. Or think of those scholarly articles that can be read to you by your text-to -speech application. Both being able to highlight individual characters in a document and having the document read to you indicates that the text is machine readable.) So, if you have a text formatted in plain text, HTML, XML, PDF, RTF, or MS Word, even if it is in a Language Other than English, you can use an open-source tool called Voyant-Tools to upload your corpus or corpora and conduct the text analysis. Voyant-Tools can also assist in widely-reading (or distant reading) a text or formulating research questions

The benefit of both TimelineJS and Voyant-Tools is that they allow you to either simply use them as is or expand the scope of your project further you become a coder. These tools are examples of open-source, web based, non-coder tools that allow for both beginner DHers and non-coders to gain access to DH strategies and methodologies while avoiding the cost of non-coder proprietary tools. For more DH tool options, both for advanced users and beginners, check out University of Toronto’s Find Digital Scholarship Tools website. 

For more on DH methodologies and applications, go to our blog and check out our catalog of  events

Best wishes, 

Your fellow (learning to code) DHer.

students working on laptops at same table

Call for Proposals: Provost’s Digital Innovation Grants 2019-2020

Applications for Provost’s Digital Innovation Grants – also known as PDIGs – are open! If you’re working on a digital project, planning to, or hoping to attend a short course or workshop to learn a skill that may support a current or future digital project, then this grant opportunity is for you!

PDIGs are broken out into 3 tiers – Training, Start-up, and Implementation grants – all of what are during by 5:00 PM on Friday, October 18, 2019

Learn more about this Call for Proposals: Provost’s Digital Innovation Grants 2019-2020 and review past projects on our digital grant website.

Reflections on the inaugural “Conversations in Digital Scholarship”

This post is collectively written by the GCDI fellows: Tuka Al-Sahlani, Silvia Rivera Alfaro, Parisa Setayesh, Maggie Schreiner, Anna Corbett , Zachary Lloyd, Cen Liu, Maura McCreight, Cortnie Belser, Peyton Cordero, and Chen Zhou.

When we, the digital fellows, set out to create an event to build community, skill share, and learn from other digital humanists, we didn’t know what it would be or what it would look like, we just knew we wanted to have conversations with other GC DH members. After many internal conversations and planning sessions, we decided that we didn’t need to create an elaborate event, we just needed others at a table to talk and share. We knew from our virtual drop-in hours, consultations, and workshops that there were other GC DH folks, beside us, who wanted and sought out a space to brainstorm, work through challenges, and simply share among a larger audience. We had our own questions and challenges and it was time we, the DH community at large, had a space and time to do just that–converse! 

On May 13th, GCDI fellows held the “Conversations in Digital Scholarship”in the C level with five topical roundtables: “AI for Qualitative Research,” “Digital Archives,” “ Education Game Design,” “Open Pedagogy on Manifold,” and “Working with Open Government Data.” After the official conversations, we all gathered in the Masters for DH lounge to share what we learned and discussed in our respective roundtables. Below you can read some of our reflections from each roundtable. 

AI for Qualitative Research

Artificial intelligence is a relatively new technology, thus, our conversation started with personal experiences that allowed us to situate ourselves as scholars who work in research and teaching. First, we centered on AI as a tool relating the reflections with similar moments when new technological tools or spaces were framed as dangerous, such as Photography or Wikipedia. An interesting point that emerged was how metacognition plays an important role in keeping our agency while using AI as a tool: thinking about how we think (and our processes and activities) makes it possible to use AI to simplify our work while maintaining the possibilities of moving forward with our work. (We compared this to the idea of using AI to be able to solve tasks, but in ways that we might not learn new skills, which goes into a possible discussion on labor issues). For research, we brought concerns on ethics and data practices, such as how to control protocols of data collection (e.g., confidentiality), where the information is stored, and our possibilities to choose not to feed the model with our research data. Another issue is questions on what data a model was trained with (can we have the information? How could that affect our research?) and the ethics behind the gathering of that data (was it publicly available information?). We moved to a question on examples of models where AI has been used in qualitative research, and we saw that several tools can do different parts of a process (e.g., using AI to get information, transcribe, do analysis, etc) and discussed how the different uses have diverse ethical implications. 

Digital Archives

In the “Digital Archives” roundtable we learned that some archivists use vibe coding to help them learn and work with museum APIs. In turn, we grappled with the archivist’s need to collect and record everything and then what to do with all that data! We also discussed the challenge of  defining identifiers and best practices when dealing with people as our archive “object.” We asked: do we ask for permission or do we use the opt-out method?  Another challenge we addressed is the labor in digitizing physical archives scattered across institutions and locations. We were privileged to be joined by Filipa Calado (former Digital Fellow) who gave us a quick workshop on how to scrape images from websites. All in all, we achieved our goal of skill sharing and building a community to return to and discuss with and collaborate with in the future. 

Educational Game Design

The Game Design roundtable was attended by gamers and non-gamers alike, each with compelling questions and ideas about incorporating game-based learning into their classrooms and their research. We discussed what makes a game compelling for students for the right reasons–how do we ensure that learning goals are baked into the game objectives and not just tacked on top? This led to a conversation on choosing the right game mechanic for the learning goal. Narrative-based, card driven, cooperative, and competitive games all have their place in pedagogy, and it’s important to build a game with a mechanic that coordinates well with what’s being taught. We shared examples of games we found inspiring or compellingly frustrating, and brainstormed solutions to in-progress work.  

 Open Pedagogy on Manifold

During the roundtable, participants explored how digital publishing can advance open pedagogy, collaborative scholarship, and creative teaching practices. Attendees shared their own experiments and interests in the platform, ranging from open textbooks and multimodal annotation to projects that emphasize oral communication and offer alternatives to proprietary systems like Blackboard.

The discussion covered project examples that demonstrate Manifold’s versatility as both a teaching tool and a platform for public scholarship. Participants shared inspiring projects on Manifold, such as projects that reimagine the textbook as a dynamic, participatory medium. Notable examples included We Eat, which frames food writing as community knowledge-making, and Poetry/Speech Database, which integrates text, sound, and metadata to support multimodal expression. Additional resources like Open Anthology of Earlier American Literature, Writing About Art and Digital Pedagogy in the Humanities offered further examples of inclusive, participatory learning approaches.

Our conversation also underscored a crucial tension: while faculty recognize the potentials of digital publishing, they encounter practical barriers such as institutional skepticism, limited training, and time constraints. The mention of departmental pushback and assessment concerns reflects deeper structural issues within academia that often prioritize traditional publishing models over digital scholarship. Participants concluded that with adequate institutional support, platforms like Manifold can foster more equitable, collaborative, and student-centered scholarship.

Working with Open Government Data 

With experts across a variety of subject areas, we explored the implications of the current administration on the access and quality of open government data. While it is not uncommon for certain government pages and initiatives to be removed when new administrations take office, both “the scale and speed of the Trump administration’s data manipulation–combined with buyouts, resignations, and other restructuring across government agencies–signal[s] a new phase in the war on public information.” The group discussed how the disappearance of previously-public data has impacted their own research, shared various data recovery efforts that have been taken up by universities and other NGOs, and brainstormed ways that we could keep these conversations going into the future, including suggesting some new potential GCDI offerings! We benefited greatly from the participation of Steve Romalewski, Director of the CUNY Mapping Service, who has a wealth of experience working with open government data and was able to share some helpful considerations and best practices with the rest of the group. This conversation was generative, insightful, and–in the opinion of all of our participants–incredibly necessary!

Latinidad in the making: DH projects and collective futurities

This virtual panel connects DH scholars whose projects recover the history of centuries of oppression in the US territory and who do community-engaged research with diverse Latinx/Hispanic and Afro Latinx communities and who respond to the present lived by these communities. The event is a co-sponsored event by GC Digital Initiatives, the Ph.D. Program in Latin American, Iberian, and Latino Cultures, and the MA Program in Digital Humanities

Panelists:
Dr. Gabriela Baeza Ventura, University of Houston
Dr. Clayton McCarl, University of North Florida
Dr. Lizbeth De La Cruz Santana, Baruch College
Dr. Eduard Arriaga-Arango, Clark University

 

To RSVP for the event, visit this link.

Digital Methods in Languages Other Than English (LOTE)

This is not a workshop. This is a skillshare session. Building on the GCDI Conversations in Digital Scholarship, during these 90 minutes, we will share, critique, and build a community as we discuss some tools we have been using or learning to use to conduct digital scholarship in languages other than English. Participants do not have to prepare any material, but are expected to share the tool and its usage with the other attendees. The facilitator will share one tool. Or, if you are curious to learn about digital methods in LOTE, feel free to attend and bring your questions!

RSVP for this skillshare session will open on the first day of the Fall 2025 semester.

Introduction to QGIS

Are you interested in exploring ways to visualize data? In this intro-level workshop to mapping, you will take your first steps toward creating your own maps and visualizing spatial data. You will be familiarized with foundational mapping concepts and terms and practice combining spatial data into a map using QGIS, an open-source spatial analytics and visualization software. No prior experience is necessary.

RSVP for workshops will open on the first day of the Fall 2025 semester.

Introduction to Python

How do you do Digital Humanities? How can you determine what tools or methods work best for your project idea? With the remarkable array of digital tools available, it can be difficult to determine which ones you’ll really need. This workshop will give an overview of digital humanities methods and introduce a variety of tools for different kinds of research. We’ll discuss the genres and vocabulary of DH from text analysis to geospatial mapping and review the many available resources online and at The Graduate Center. Feel free to come with your project ideas and questions.

RSVP for workshops will open on the first day of the Fall 2025 semester.

Tools of DH

How do you do Digital Humanities? How can you determine what tools or methods work best for your project idea? With the remarkable array of digital tools available, it can be difficult to determine which ones you’ll really need. This workshop will give an overview of digital humanities methods and introduce a variety of tools for different kinds of research. We’ll discuss the genres and vocabulary of DH from text analysis to geospatial mapping and review the many available resources online and at The Graduate Center. Feel free to come with your project ideas and questions.

RSVP for the workshop will open on the first day of the Fall 2025 semester.

Image of Queer illder ancestor Kiyoshi Kuromiya of ACT UP Philadelphia starts to hurl himself onto brick wall outside the U.S. Trade Representative’s Office in 1999 at his final civil disobedience action after a life of justice struggles, at a protest for global HIV treatment access.

Finding Tools for Queer “Illder” Activist History at the Digital Research Institute

 

This post is written by guest contributor JD Davids, Winter 2025 DRI Participant. Connect with JD Davids on BlueSky: @crankyqueer.org; LinkedIn: @JD Davids; and Instagram @thecrankyqueer

There’s so much I appreciate about being a queer and trans activist rooted in the HIV/AIDS epidemic. For one, it allowed me to have a digital life far before the norm – starting with a dialup account in 1991 or so, thanks to fellow ACT UP Philadelphia member, lifelong social justice activist and tech-for-justice innovator Kiyoshi Kuromiya, founder of the Critical Path AIDS Project

I’d use the terminal shell of the computer – the classic black screen with text-only commands – to enter commands for “telnet,” a network protocol that allowed my computer to log onto others as part of the same network and send email. 

Access to telnet allowed us to talk to other HIV activists around the world, so we could organize for access to medication, challenge exorbitant drug prices, and share lifesaving information on HIV prevention, research opportunities and promising therapies. 

Fast forward to 2025, and I’m pulling up the terminal shell for the first time in decades… 

But this time,  the terminal shell is a part of the whirlwind, mind-opening Digital Research Institute (DRI) at CUNY Graduate Center, via Zoom. I’d come to the Biography and Memoir program at the Grad Center in the fall, eager to share experiences and struggles as an activist and disabled “illder” – someone who has learned much by being sick with immune-depleting and autoimmune illnesses, now including Long COVID. Chronic illness has taught me a lot – but also hasn’t left much time for keeping apace with technology.

The DRI gave me exactly what I wanted and needed as a returning student – a comprehendible and remarkably comprehensive overview of the potentials of digital research today. I thought of Kiyoshi often in that four-day stretch, imagining his delight at the current state of coding, mapping, web design and other aspects of computing. 

I didn’t expect I would become an expert. But, for example, I now have a much better understanding of the potential of Python for text analysis. This could help me as I unearth and seek to analyze archival documents showing early collaboration in ACT UP between people with HIV (mostly gay men) and people struggling for recognition of chronic fatigue (mostly lesbians), which can offer vital lessons for our struggles in this time of rampant Long COVID and attacks on LGBTQ and HIV research.

I’ll be honest: I didn’t emerge from the DRI with all the skills to create that code myself. But I can twin what I’ve learned at the DRI with the skills and principles that Kiyoshi and other activists instilled in me, that will allow me to move the project forward:

  • Know it is possible: I was able to contribute to efforts to bring HIV medication to tens of millions of people worldwide who had been written off by capitalist Pharma and governments alike. We knew it could happen if there was political will, and we organized worldwide to insist that it happen (and need to do so again due to the cruel policies of the current United States leadership that’s cutting off access to meds for people worldwide.)

Now, after my time in the DRI, I know it is possible to use digital tools to help make meaning of the archives of HIV history that have vital lessons for today and the future. It’s a much smaller scale, sure, but it can happen!

  • We make it work by working together: We never have to fight or figure it out alone. We can and must make the changes we need by bringing each of our skills and passions to reach shared goals. 

I may not have a fluency with coding or other skills I learned more about with the DRI. But I know there’s people at the Grad Center – including the fantastic digital fellows I met at the DRI, who could help me figure out the coding. 

Through the years, as I’ve trained a lot of people as activists, I’ve come to believe that there’s one central practice that is vital – knowing ourselves. We combine our strengths to make the changes we need. The DRI was a wonderful way to see people doing just that – supporting one another to grapple with digital technology to further our important work. 

It’s not about becoming an instant tech expert, though I am sure the training has inspired many to go on to deepen and expand their skills. It’s an opportunity to build our collective capacity to understand and make the most of digital research – and I think Kiyoshi would have loved it too. 

Reordering Graphs in ggplot Plot: A Step-by-Step Guide

When we create a plot using ggplot() in R, the order of the categories on the x-axis is set by default. However, for better clarity and aesthetics, we often want to reorder the plot based on the values being shown. This tutorial walks through how to reorder bars in a bar plot using a hypothetical dataset of student scores.

Prepare the dataset

Let’s start by loading the required library and creating our dataset. We’ll use the tidyverse package, which is useful for both data wrangling and plotting.

library(tidyverse)

student_scores <- data.frame( 
Student_ID = 1:11, 
English = c(85, 78, 92, 67, 88, 76, 95, 80, 72, 90, 100), 
Biology = c(95, 87, 90, 79, 94, 96, 93, 82, 89, 97, 105), 
Maths = c(90, 82, 58, 74, 89, 91, 88, 77, 84, 92, 100), 
Physics = c(78, 85, 89, 80, 90, 76, 83, 91, 87, 79, 100))

# Take a look at what the spreadsheet looks like
view(student_scores)

The spreadsheet includes scores for 11 students across four subjects: English, Biology, Maths, and Physics.

Calculate Mean Scores for Each Course

Our goal is to create a bar plot that shows the average score for each course. To do this, we first need to reshape the dataset into a long format with two columns: one for the course name and one for the score.

# reshape data
scores_reshape <- student_scores %>% 
pivot_longer( cols = English : Physics, 
              names_to = "courses", 
              values_to = "scores")

# get the data set that has the mean score for each subject

Next, we calculate the mean score for each course using group_by() and summarize():

mean <- scores_reshape %>% 
  group_by(courses) %>% 
  summarize(ave = mean(scores))

Now we have a clean dataset with the average scores:

Bar Plot with Default Order

Let’s plot the mean scores using a basic ggplot() bar plot:

ggplot(mean, aes(x = courses, y = ave, fill = courses)) +
  geom_bar(stat = "identity") + 
  theme_classic() + 
  labs(x = "Courses", y = "Mean Scores")

This plot uses the default alphabetic order of the course names, which may not reflect the order of the values (scores).

What if we want to order the bars based on the order of the values?

Reordering Bars Based on Values

To reorder the bars so that they appear from the lowest to the highest mean score, we can sort the data and prepend row numbers to the course names. This forces ggplot() to follow our desired order:

mean_ordered <- mean %>% 
  arrange(ave) %>% # order the scores from lowest to highest
  mutate(courses = paste0(row_number(), "_", courses))

The courses column now looks like this:

Now we plot the reordered data:

ggplot(mean_ordered, aes(x = courses, y = ave, fill = courses))+ 
  geom_bar(stat = "identity")+ 
  theme_classic()+ 
  labs(x = "Courses", y = "Mean Scores")

This shows the courses in the desired order, but the course names are now prefixed with numbers:

Cleaning Up the Axis Labels

To display clean course names on the x-axis while keeping the desired order, we can use scale_x_discrete() to relabel the courses on the x-axis and scale_fill_discrete() to relabel the courses on the legend:

ggplot(mean_ordered, aes(x = courses, y = ave, fill = courses)) + 
  geom_bar(stat = "identity") + 
  scale_x_discrete( 
    labels = c(
       "1_English" = "English", # rename the name of each course on the x-axis 
       "2_Maths" = "Maths", 
       "3_Physics" = "Physics", 
       "4_Biology" = "Biology" ))+
   scale_fill_discrete( 
     labels = c( 
        "1_Maths" = "Maths",
        "2_Biology" = "Biology", 
        "3_Physics" = "Physics", 
        "4_English" = "English" ) 
  theme_classic()+ 
  labs(x = "Courses", y = "Mean Scores")

And we successfully get the bar plot of scores for the four courses ordered from the lowest to the highest.

Now you have a bar plot with the courses ordered from the lowest to the highest average score—much more informative and visually appealing!

GC Digital Showcase

The GC Digital Showcase is an annual event featuring interdisciplinary digital scholarship produced by graduate students, digital fellows, working groups, grant recipients, classes, and programs across the CUNY Graduate Center.