Please find the slides from the meeting below: {{ :projects:workgroups:nlp_wg_meeting_20181114.pdf |}} Please find the recording of the meeting below: {{ :projects:workgroups:2018-11-14_14.00_ohdsi-nlp-wg.mp4 |}} Action items updates: * Rules for defining term_exists - led by Stephane Meystre - COMPLETED * Mapping of CUIs to standard terminology - led by Juan Banda - COMPLETED, here's the link to the repo: [[https://github.com/thepanacealab/OHDSIananke]] * Mapping of Note Types to LOINC/standard vocabulary - Karthik Natarajan, Ruth Reeves, Jon Duke and Hua Xu will work together on this: Multiple note types sent to Jon. To be followed up. * Landscape Analysis of section identifier systems and proposal of a standard terminology for use - led by Hua Xu with help from Karthik Natarajan: SecTagger and section header ontology obtained from Vanderbilt. Need to figure out normalization. * Examples and rules for term_temporal - led by George Hripsack (Sunny): The CDM to be updated with the definition below: * If in the past, then term_temporal is PAST. And term exists has to also be positive. You never have term_exists=no with PAST. * If not in the past, then term_temporal is blank. * If it is from the same inpatient admission, that is not considered past, but still in the present. * More discussion needed on whether to merge term_exists and term_temporal into one variable, or to leave them apart. * Standardization of term_modifiers and values - led by Hua Xu: To define modifier list and normalized values for each modifier were the tasks. Initial review were from SHARPn modifiers (also used in cTAKES) and Wendy Chapman's group ([[https://github.com/Blulab-Utah/resource_ontologies/wiki]]). * Should certainty be merged with negation? - in instances where the negation is not about certainty (existential/systemic), it would be better to keep them separate. Another point to think about is the utility of the categories (definite, probable, uncertain, probable negated, definite negated) and the ability of NLP systems to annotate the terms with them. Boolean negation is easier to annotate and handle. * Granularity of modifiers for subject - Family, Other, Patient may be sufficient for current datasets * Generic/conditional modifier - Generic and Conditional as modifier may be acceptable. cTAKES does not support this currently. * Course - possible values (very limited set) presented. Same as cTAKES. Default way of dealing with it could be to assign the ontology and the value from it. Open ended for now. Can be modified later based on use cases/usage. * Temporal - is pulled from cTAKES, could be improved. * Discovery technique - Name of NLPtool, in cTAKES currently, the methodology (gold-standard, rule-based, dictionary look-up etc.) used is the discovery technique. Utility? - needs to be discussed * Confidence score - how reliable * Disease specific modifiers - Body location, Severity (some values overlap with cTAKES) * Medication specific modifiers - need to maybe add strength? * Test specific - Change reference range to lower threshold, upper threshold? * Procedure specific modifiers - need examples before discussion, body location, also add laterality here. * term modifier was used to connect two tables in CDM that could not be otherwise obviously linked together. * include NLP in the Condition Type concept ID, or fact relationship table. * How has the NLP table information been used by the community?