Postdoctoral Researcher in AI-Driven Data Curation and Data Integration

University of Pennsylvania

Philadelphia, PA

JOB DETAILS
SKILLS
Academic Research, Amazon Web Services (AWS), Artificial Intelligence (AI), Best Practices, Bioinformatics, Biomedicine, Biostatistics, Clinical Research, Cloud Computing, Communication Skills, Computer Programming, Computer Science, Data Processing, Data Science, Data Sets, Disease, Docker, Ecosystems, Epidemiology, Go Programming Language (Golang), Informatics, Interoperability, Java, Machine Learning, Metadata, National Institutes of Health (NIH), Natural Language Processing (NLP), Ontology, Open Source, Presentation/Verbal Skills, Publications, Python Programming/Scripting Language, Research Skills, Scientific Research, Stewardship, Structured Data, Technical Presentation, Technical Writing, Unstructured Data, Usability Engineering, Writing Skills
LOCATION
Philadelphia, PA
POSTED
5 days ago


Faculty Mentor: Joost Wagenaar

Department: Informatics

Number of Positions: 2

Open to applications from US Citizens and foreign nationals.



The Wagenaar Lab is seeking a highly motivated Postdoctoral Researcher to conduct research at the intersection of artificial intelligence, common data elements (CDEs), and large-scale biomedical datasets. The Wagenaar Lab is jointly based in the Institute for Biomedical Informatics and the Department of Biostatistics, Epidemiology, and Informatics at the University of Pennsylvania, and leads the academic development of the Pennsieve scientific data platform. The lab’s mission is to create scalable, sustainable infrastructure that enables data integration, reuse, and discovery across clinical and scientific research domains.



This postdoctoral position will focus on developing AI-enabled methods to automate and augment data curation, with an emphasis on leveraging CDEs to improve the usability, interoperability, and scientific value of public datasets. The successful candidate will work across disease areas—including Epilepsy, Immune Health, and programs within the NIH HEAL Initiative—to design approaches that harmonize heterogeneous datasets, enrich metadata, and support scalable data exploration.



The Postdoctoral Researcher will work closely with the Pennsieve development team and a broad network of scientific collaborators to translate industry best practices in data engineering and AI into the academic research ecosystem. A central goal of this role is to move beyond manual, project-specific curation toward reproducible, automated, and extensible curation workflows that can be applied across datasets, programs, and institutions.



In addition to platform and method development, the Postdoctoral Researcher is expected to contribute to peer-reviewed publications, open-source software, and community-facing resources that advance AI-enabled data stewardship and reuse.



Responsibilities





+ Develop AI-based methods and deploy them at scale to automate and augment data curation using Common Data Elements



+ Design workflows to harmonize, validate, and enrich public datasets across Epilepsy, Immune Health, and NIH HEAL programs



+ Develop novel mechanisms to interrogate, visualize and interact with complex scientific datasets and increase the value of these datasets for the scientific community.



+ Integrate curation methods into scalable, cloud-based scientific data platforms



+ Collaborate with the Pennsieve development team and scientific partners to align methods with real research workflows



+ Evaluate and validate curation approaches using large, heterogeneous public datasets



+ Prepare manuscripts, technical documentation, and presentations describing methods and outcomes.









Qualifications





+ Ph.D. (preferred) or Master’s degree in Biomedical Informatics, Computer Science, Data Science, Bioinformatics, or a related field



+ Experience with machine learning, natural language processing, or AI applied to structured and unstructured data



+ Familiarity with Common Data Elements, data standards, or ontology-based data representation (preferred)



+ Strong programming skills in Docker, Python, Go, Java, or related languages



+ Experience working with large-scale biomedical or clinical datasets



+ Experience with cloud-based data processing and scalable analytics environments (AWS preferred)



+ Strong written and verbal communication skills and an interest in interdisciplinary collaboration





Please include a cover letter and a CV for consideration.




The University of Pennsylvania is an equal opportunity employer. Candidates are considered for employment without regard to race, color, sex, sexual orientation, religion, creed, national origin (including shared ancestry or ethnic characteristics), citizenship status, age, disability, veteran status or any class protected under applicable federal, state, or local law.

About the Company

U

University of Pennsylvania

Penn's beautiful urban campus provides easy access to a range of educational, cultural, and recreational activities. We offer excellent healthcare and tuition benefits for you and your family, as well as generous retirement benefits, professional development opportunities and flexible work options. Penn is a diverse, multicultural learning community at the cutting edge of research and information technology. Click here to read more about why we're a Best Employer. Not only does Penn offer a unique environment within the city of Philadelphia to work, we also provide a wide array of employee benefits. From a competitive retirement program and comprehensive health care options to health promotion and wellness services and tuition assistance, there's a wealth of opportunities and resources available to you at Penn.

Penn is frequently cited as an outstanding employer:

  • Recipient of the 2010 Terri Lynne Lokoff Child Care Foundation Corporate Leadership Award for commitment to supporting worklife balance programs.
  • Selected as a recipient of the 2010 Healthy Workplace Award by the Philadelphia Business Journal and presenting sponsor UnitedHealthcare
  • Recognized as a 2010 Top Workplace by The Philadelphia Inquirer/Daily News.
  • Recipient of the 2009 Vision Award for commitment to workforce diversity and economic inclusion.
  • Selected for Computerworld's 100 Best Places to Work in IT six years in a row!
  • Recognized by the Delaware Valley Association for the Education of Young Children as a 2006 Best Employer for Working Parents
  • Named a Best Place to Work in the November 2007 issue of Philadelphia Magazine.
  • Designated a Best Workplace for Commuters by the National Center for Transit Research
COMPANY SIZE
100 to 499 employees
INDUSTRY
Advertising and PR Services
FOUNDED
1990
WEBSITE
http://www.cmimedia.com/