Lead Chemistry Database Curator
India | Remote
Responsibilities
Location India - Remote, IN Job ID JR-033608Lead Chemistry Data Curator
Job Description:
Direct content development for the ChemACX database. Lead team in curating all available chemical information from vendors’ data into machine readable formats and loading them into a central database. Maintain quality control. Improve existing processes for curation through automation.
Responsibilities:
Seek out new vendors for inclusion to increase the breadth of coverage of the database. Keep in touch with existing vendors regarding updates to their catalogs. Correspond with users regarding errors and potential improvements in the data. Assign catalogs to curators based on priority. Formulate and direct projects for improvement of existing data.
Extract the vendors’ data available in different file formats (SDF, CSV, Excel, and plain text) and bring it into a standard format as required to load into a central database. This includes cleaning of the data, conversion of the data from one format to another (e.g. name to structure and vice versa) and generating 2D structure data in SDfiles for loading purposes. Report bugs found in internal tools as necessary.
Run weekly and quarterly checks on data to maintain quality control. Address issues as necessary. Maintain curation metrics for reporting to management.
Record the above activities into JIRA and Confluence.
Examine all steps of the existing process, identify areas where automation can be used, implement the solutions, and devise automated tests.
Qualifications and Experience:
- Graduate degree in chemistry, preferably organic chemistry
- Knowledge of chemical vendors worldwide
Required Skills:
- Experience in working with SQL-based databases, including strong knowledge in using scripts to extract, modify, and load data.
- Proficient in verbal and written communications.
- Excellent interpersonal skills.
Good to Have:
- Experience using ChemDraw/ChemOffice.
- Knowledge of IUPAC nomenclature and SDfile format.
- Knowledge in handling chemical databases with huge datasets.
- Experience with a scripting language like Python.
- Knowledge of agile methodologies.
PerkinElmer is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, or veteran status or any other characteristics protected by applicable law. PerkinElmer is committed to a culturally diverse workforce.