Overview
The article recaps the LinkedIn NYC Tech Talk Series focused on Machine Learning and Data Science, featuring presentations from experts at LinkedIn, Cornell Medicine, and Google. It highlights innovative approaches to data extraction and analysis in various domains, including enterprise relationships and healthcare.
What You'll Learn
1
How to harvest company relationship data from news articles using machine learning techniques
2
Why analyzing Electronic Health Records (EHR) can improve patient treatment decisions
3
How to utilize the Perspective API to enhance online discussions
Prerequisites & Requirements
- Understanding of machine learning concepts and data extraction techniques
- Familiarity with Electronic Health Records and their applications in healthcare(optional)
Key Questions Answered
How can enterprise relationships be inferred from social networking data?
Enterprise relationships can be inferred by harvesting company relation data points from news articles, which involves curating a large training dataset and applying machine learning models to extract high-quality relationship data.
What are health trajectories and how are they built using EHR data?
Health trajectories are constructed by analyzing Electronic Health Records from millions of patients, incorporating multidimensional information and measuring similarity among patients' health changes to identify common patterns and optimize treatment.
What is the Perspective API and how does it improve online discussions?
The Perspective API is a tool developed by Google that uses machine learning to analyze comments and identify toxic language, helping to moderate discussions and improve the quality of online interactions.
Key Statistics & Figures
Number of attendees
145
The total number of outside attendees at the Machine Learning and Data Science meetup.
Technologies & Tools
API
Perspective API
Used to analyze and improve online discussions by identifying toxic language.
Key Actionable Insights
1Implementing machine learning models for data extraction can significantly enhance the quality of insights derived from large datasets.By automating the curation of training datasets, organizations can improve the accuracy and efficiency of their data analysis processes.
2Utilizing health trajectories can lead to better patient outcomes by tailoring treatment plans based on historical health data.This approach allows healthcare providers to make informed decisions that consider both medical history and social determinants of health.
3The Perspective API can be a valuable tool for organizations looking to foster healthier online communities.By identifying and mitigating toxic comments, platforms can create a more welcoming environment for users to engage in discussions.
Common Pitfalls
1
Failing to curate a diverse training dataset can lead to biased machine learning models.
When training data is not representative of all scenarios, models may misclassify certain inputs, particularly those involving underrepresented identities.
Related Concepts
Machine Learning
Data Science
Electronic Health Records
Natural Language Processing