Data Visualization

Throughout my graudate career, I've realized that I quite enjoy the process of creating data visuals and using them as another tool to tell stories about complex scientific concepts in a more approachable way. I've honed my skills by creating figures for my research papers and posters, coursework, and working on personal projects. Take a look at some of my work below!

Interactive Local Volume Database

python

Streamlit

Altair

While working on a research project on ultra-faint dwarf galaxies, I found myself frequently using Andrew Pace's Local Volume Database (LVD) on Github to look at properties of nearby dwarf galaxies and star clusters. Users can access the raw data through a spreadsheet program or read it into a programming language like Python, which was wonderful for making publication-quality figures, but it was hard to just explore the data -- answering questions such as "how does the metallicity of a certain galaxy compare with other dwarfs in a specific luminosity range?" or "how do Milky Way dwarfs compare to Andromeda dwarfs in terms of size?" requires data wrangling and coding.

As a side project, I created an interactive web app that allows users to explore trends and build intuition about local dwarf and globular cluster properties, as collated in the LVD. I wanted to have an easier way to just explore the data, look at relationships between different properties, and look at individual objects in more detail. Users can quickly make scatter plots of all the different system properties (mass, metallicity, half-light radius, etc.) and filter by the host galaxy and other system properties. You can pan and zoom around the plot, hover over data points to see the selected properties of the system, change the colors of the points, and show the properties and raw data for specific systems.

The interactive LVD is still actively in development, and I plan to add more features such as the ability to make histograms of a specific system property and adding links to the relevant literature for each property and object. Any comments/suggestions/bug finds are much appreciated and can be reported on the project's Github repository!

Explore the database!

SI 649: Information Visualization

In the Winter 2024 semester, I took my first formal foray into learning about data visualization by taking a course on Information Visualization from the University of Michigan's School of Information. We completed three mini-projects for the class, focusing on different types of data visualizations.

Static Visualization

Flourish

python

Adobe Illustrator

Our first project required us to create three static visualizations to accompany a NYT news article about the migrant crisis in the US. The purpose of these visuals was to enhance and add context and detail about the article, which cites a lot of statistics but does not include any original visuals to go with them. I chose to focus on making visuals (pictured right) showcasing the story of why asylum seekers are leaving, how the countries they come from and what US states they've been going to has changed over time, and how overwhelmed the US immigration system is.

Click below to see the full-sized visuals, the original article and my annotations, and the write-up of my thought-process in making these visuals.

See the visuals!

Static visualizations for NYT migrant crisis article

Scientific Visualization

python

Altair

Panel

Our second project challenged us to take an existing static visualization from a scientific paper and recreate it as an interactive version that can be used to support the reader in their understanding of the data. Though we were given several papers to choose from, I chose find my own paper and dataset and recreated a figure from a well-cited review paper we had discussed at length in my graduate galaxies class: “Physical Properties and Environments of Nearby Galaxies” by Michael Blanton and John Moustakas.

I combined Figures 8 and 12 from the paper into an interactive visualization that allows users to explore the relationships between galaxy properties such as stellar mass, color, Sérsic index, and size in the context of different galaxy morphologies (pictured right). Hovering over a point in one of the scatter plots highlights that galaxy in the other plots and displays the relevant characteristics in a tooltip, while clicking on the point brings up an image of that galaxy. The colors of each morphological type (i.e. E, S0, Irr) are also customizable, and one can select individual types to focus on.

This was my first time working making an interactive visualization, and this project inspired and gave me some of the tools and skills I needed to create the interactive Local Volume Database web app later on. In the future I hope to go back and revamp this project to make it look nicer, add more features, and make it more user-friendly.

Check out the viz! Read the writeup!

Interactive scientific visualization project

Narrative Visualization

Flourish

python

Our last project combined the skills we learned in the previous two projects and required us to create a narrative visualization that tells a data-driven story, combining text with integrated interactive visuals. We had the freedom to choose any topic we wanted, so I wanted to create a narrative about space that was more human-centric. I decided to investigate the namesakes of planetary and lunar features in our Solar System, and how they reflect our some of the values and biases of the people who named them.

I compiled and hand-cleaned data from the United States Geological Survey's (USGS) Gazeteer of Planetary Nomenclature, which contains the names and descriptions of all named features on planets and moons in our Solar System. This narrative explores the genders, professions, and nationalities of the individuals who have been honored with namesakes in our Solar System, as well as general naming conventions as set by the Internaional Astronomical Union (IAU).

Read the narrative!

Narrative visualization project page shwing a scatter plot

ELVES-HST Survey Graphic

python

Adobe Illustrator

I am currently part of a research team using the Hubble Space Telescope to determine distances to 103 low-luminosity dwarf galaxy candidates around 21 galaxies in the Local Volume. This survey is still ongoing, and I presented on this project at the Galactic Frontiers II conference in July 2025 at Dartmouth College. As part of my poster, I created a graphic to illustrate the current status of the survey (pictured below). It shows the locations of the 21 host galaxies on a streographic projection, with pie charts showing the fraction of candidate satellite galaxies around each host galaxy that have already been imaged and confirmed by our team, rejected as background objects, are uncertain and require more investigation, or have not yet been observed. The size of each pie chart is proportional to the number of candidate satellites around that host galaxy.

a stereographic projection shwoing the band of the Milky Way in purple and the locations of 21 host galaxies in the Local Volume, with pie charts showing the status of the ELVES-HST survey around each host galaxy

GLAS Messier Marathon

python

Adobe Illustrator

A popular activity in the amateur astronomy community is the Messier Marathon, where participants try to observe all 110 Messier objects in one night. The Messier catalog is a list of astronomical objects compiled by the French astronomer Charles Messier in the 18th century, which includes nebulae, star clusters, and galaxies, as objects that astronomers should ignore while searching for comets. Inadvertently, the Messier catalog has become a list of some of the most popular objects to observe for amateur astronomers, given that many are easily observed from the Northern Hemisphere. Messier Marathons are often held in the spring (March/April) around a new moon, when it is possible to observe all 110 objects in one night.

In 2022, GLAS Education and Yerkes Observatory hosted a Messier Marathon star party and fundraiser, and I created a poster showcasing all the objects that were observed and imaged during the event. Each object image has a border around it indicating what type of object it is (galaxy, nebula, star cluster, etc.), and the color of the border corresponds to the constellation the object is found in. The poster also includes diagrams of all the constellations that host Messier objects and where each object is located with respect to the constellation. Stars in each constellation are colored by their temperatures.

Poster showing 57 circles with images of Messier objects. As a border are diagrams of the constellations that host Messier objects