This may be out of scope but wanted to add for discussion.
Frequently we want to replace instances of a token, word, or phrase in lexical ontology elements such as labels, definitions.
E.g
https://wiki.geneontology.org/Ontology_meeting_2024-04-08#Bulk_update_specific_keywords
This may be out of scope as it drifts into NLP territory. How are the boundaries of words or tokens defined? Do we have different tokenizers for chemicals vs biological language? What about bespoke rules that exclude certain tokens from replacement in some contexts?
This may be better handled by a separate tool that generates KGCL rename and redefine commands given an ontology plus some replacement rules.
In fact the logic is very similar to that uses for the synonymizer
This may be out of scope but wanted to add for discussion.
Frequently we want to replace instances of a token, word, or phrase in lexical ontology elements such as labels, definitions.
E.g
https://wiki.geneontology.org/Ontology_meeting_2024-04-08#Bulk_update_specific_keywords
This may be out of scope as it drifts into NLP territory. How are the boundaries of words or tokens defined? Do we have different tokenizers for chemicals vs biological language? What about bespoke rules that exclude certain tokens from replacement in some contexts?
This may be better handled by a separate tool that generates KGCL rename and redefine commands given an ontology plus some replacement rules.
In fact the logic is very similar to that uses for the synonymizer