Professional Summary

  • Data and Information Scientist with over 12 years of experience in enterprise-level and lab-level data management in science, medicine, social science, business, law, and industry.
  • I’m committed to building, documenting, and maintaining processes and systems that support strategic organizational goals, unburden teams, and enable innovation.
  • My passion for information access led me to earn a Master’s Degree in Library and Information Science from the University of Washington, which gives me a unique perspective on curating and stewarding data and digital assets for both access and long-term preservation.

Demonstrated ability to

  • Learn quickly, think creatively, and work collaboratively
  • Manage projects with multiple stakeholders
  • Build and maintain relationships with external stakeholders
  • Communicate technical details to a non-technical audience
  • Develop and implement enterprise-wide data initiatives
  • Curate and publish diverse data and research outputs for complex organizations

Skills and proficiencies include

  • Expertise in data curation best practices, data documentation, data modeling, and database management systems
  • Strong written and verbal communication, attention to detail, analytical thinking, and problem-solving abilities
  • Using Python (pandas, numpy, scikit-learn, keras, tensorflow), R, APIs, SQL, and Tableau to clean, investigate, analyze, and visualize data
  • Master Data Management (MDM) and data governance best practices
  • Digital Asset Management (DAM) tools and best practices
  • Expert knowledge of statistical and related research methodologies (descriptive and inferential, quantitative and qualitative)
  • Familiarity with AWS, Snowflake, Jira, GitHub, Jupyter notebooks, Microsoft Office applications, and other collaborative tools

Experience

Data Science Associate

2022 - 2023
Pitney Bowes

Summary

  • Worked across PB’s many lines of business to store, curate, and provide access to structured and unstructured data and information in corporate repositories in alignment with our data models.
  • Worked with AWS, Snowflake, and SelectStar to build data domains with accompanying data dictionaries.
  • Built taxonomies for classification of data products and information in cooperation with subject matter experts in several domains.
  • Ingested and documented assets in Pitney Bowes’ data catalog, providing a single source of truth for business applications.
  • Analyzed usage of data catalog to determine cross-company team needs, gather feedback, and address issues.
  • Advanced the visibility and discovery of data by developing processes to streamline data collection, curation, and dissemination.

Data & Assessment Librarian

2017 - 2022
Yale Law School

Summary

  • Led data and assessment efforts for the institution, including designing, distributing, and analyzing surveys of the user populations.
  • Built, documented, and visualized datasets from multiple sources using SQL, Python, R, OpenRefine, and Tableau in order to assess institutional data.
  • Created reports and visualizations of data on services and collections to strategically communicate institutional activities to a diverse group of stakeholders.
  • Instructed researchers on the discovery, collection, use, analysis, and management of data, software, programming tools, and other digital assets across the research lifecycle.
  • Consulted one-on-one with researchers on designing, implementing, and completing studies requiring significant statistical or data-related research.

Data Librarian for the Social & Natural Sciences

2011 - 2017
Yale University

Summary

  • Established and coordinated the university-wide Research Data Consultation Group to assist Yale researchers in the sciences and social sciences with data analysis and management at any point in their research.
  • Collaborated with colleagues across Yale’s libraries to architect a digital repository system and write a digital preservation policy for born-digital and archival materials.
  • Created and taught courses on research data management, both in general and for specific disciplines in collaboration with Yale faculty in astrophysics and political science.
  • Collaborated with the Provost’s Office to research, write, and support Yale’s official policy on Research Data and Materials.

Projects

This is a short list of projects I've worked on. A more comprehensive portfolio page is under construction.

Monster Mashup 2015 - Horror Business - Using Open Refine to clean and join data about horror films held in the Yale University Library VHS collection. An online accompaniment to a Halloween-themed data workshop hosted at Yale for several years.
Monster Mashup 2014 - Finding the Devil in Contemporary Music - Using Open Refine to clean and explore data within a dataset of 500,000 songs. An online accompaniment to a Halloween-themed data workshop hosted at Yale for several years.
Monster Mashup 2013 - Witch Trials - Using Open Refine to clean, join, and explore data about witch trials in the US, Europe, and England. An online accompaniment to a Halloween-themed data workshop hosted at Yale for several years.
Research Data @ Yale - I founded and coordinated Yale's Research Data Consultation Group during my time as Data Librarian. Research Data @ Yale was a university-wide group that provideed consultation on best practices, implemented data management services, and linked users to resources for assistance at any point in the data lifecycle. Several of our members also drafted and implemented Yale's official university policy on research data and materials.
Yale Day of Data - I founded and chaired the Yale Day of Data Conference from 2013 - 2017. Day of Data brings together scholars from across academia, government, and industry to discuss the research data landscape at Yale and beyond. Videos of previous presentations are available on the conference website.