OMIM Slurp¶
- Navigate to src/ontology
- Create a new branch
- In terminal:
sh run.sh make omim_slurp -B
- two new files will be created in src/ontology/tmp:
- ps_slurp_omim.obo
-
rest_slurp_omim.obo
-
Open these files and examine them closely for any issues. For example, are labels and synonyms appropriately capitalized, etc.
- For OMIMPS:
- Check the OMIMPS to see if a grouping class already exists in Mondo (there may already be a grouping class from Orphanet, in which case, just add the OMIMPS as an additional dbxref.) To check, check the ID after the PS (for example, if the OMIMPS is OMIMPS:131760, check OMIM:131760 to see if that term already exists in Mondo.)
- If a prototype term exists (i.e., the original term in OMIM, where another gene was identified later and that term term became type 1 and other type 2, etc. were created, and then an OMIM Phenotypic Series (OMIMPS) was created), this term needs to be split.
- Note - if you add a new OMIMPS, the children will not automatically be added, they need to be added manually. (Some children terms may not be in Mondo and will be added in a future slurp.)
- New OMIMPS terms can be added directly into mondo-edit.obo text file. Note, they may not have the superclass and they also need a subclassOf axiom (or you will get a QC error):
is_a: MONDO:0003847 {source="OMIMPS:XXXXXX"} ! l'Mendelian disease'
relationship: has_characteristic MONDO:0021152 {source="OMIMPS:XXXXXX"} ! inherited
- For rest of OMIM:
- Check the terms to make sure they do not already exist in Mondo. (Search for OMIM:XXXXXX in mondo-edit.obo text file.)
- If the term does already exist in Mondo, verify the term is equivalent by checking the OMIM record online, then add the OMIM dbxref and MONDO:equivalent to to the Mondo class.
- Check the capitalization of each new Mondo label and fix if needed.
- Synonyms should be exact, fix if needed. (Search for RELATED and replace with EXACT in the text file.)
- Add abbreviations to the abbreviation synonyms:
- turn on regex buttons in Atom (.* and Aa)
- Find:
^(synonym: "[A-Z0-9]+["][ ][A-Z]+[ ])\[
- Replace:
$1ABBREVIATION [
- Note, test a few of these before doing a global find and replaced_by
- Manually add these to the mondo-edit.obo file by copying and pasting the lines into the mondo-edit.obo file in a text editor such as Atom.
- Re-run step 3 until all the files are empty.