Our lab is dedicated to research on NLP-driven techniques for science communication. We develop language technologies for convenient, accurate and equitable information services for scientists and the public.
Current Projects
- PreCheck: Understanding Press Release Exaggeration of Scientific Research
The PreCheck project aims for detecting three common types of exaggerated claims in health-related press releases: causal claim from correlational findings, extrapolating animal study results to humans, and unwarranted health advice. - News2References: Link News to Research Literature
The News2Reference project aims to accurately link science news articles to the original research publications. Often missing in current news articles, the links can be used to measure the social impact of a research work or detect inaccuracies or distortions in science news. - Cora (Citation Opinion Retrieval and Analysis):
The CORA project aims to develop an automated tool that can plug into a full-text bibliographic database such as PubMed, extract the citation statements toward a cited article, separate substantial citations from perfunctory ones, and categorize substantial citation opinions by their purposes (e.g. comparison, critique, etc.), subjects (e.g. methods, results, etc.), and tones (e.g. positive, negative, and neutral). This tool will help find the most useful comments from large numbers of citations, and facilitate various downstream applications, such as literature review, citation bias detection, and research impact assessment.
Recent Publications
- Detecting Health Advice in Medical Research Literature
- News2PubMed: A Browser Extension for Linking Health News to Medical Literature
- Linking Health News to Research Literature
- Self Promotion in US Congressional Tweets
- Analyzing Preservice Teachers’ Reflection Journals Using Text-mining Techniques
Chen Y, Yu B & Yu Y (2021)
International Journal of Innovation in Education - Measuring Correlation-to-Causation Exaggeration in Press Releases
Media mention:
- Information Quality of Reddit Link Posts on Health News
Zhou H & Yu B (2020)
The 2020 iConference. 186-197 :: Pdf - Interventions to support consumer evaluation of online health information credibility: A scoping review
Song S, Y Zhang Y & Yu B (2020)
International Journal of Medical Informatics, 145(104321) - The 7 Ps marketing mix of home-sharing services: Mining travelers’ online reviews on Airbnb
Kwok L, Tang Y & Yu B (2020)
International Journal of Hospitality Management 90(102616) - Detecting Causal Language Use in Science Findings
Yu B, Li Y & Wang J (2019)
EMNLP’2019, 4656-4666 :: Code & data - Identifying finding sentences in conclusion subsections of biomedical abstracts
Li Y & Yu B. (2019)
The 2019 iConference, 679-689 :: Pdf - HClaimE: A tool to identify health claims in health news headlines
Yuan S & Yu B (2019)
Information Processing and Management, 56(4), 1220-1233 - Toward training and assessing reproducible data analysis in data science education
Yu B & Hu X (2019)
Data Intelligence 1(4), 381-392
Web applications
- News2PubMed: Linking Health News to Medical Literature
This Chrome extension allows the reader of a health news article to quickly retrieve related medical/health research papers.
- CORA: Understand PubMed Citation Contexts
This Chrome extension aims to help people better understand the citation contexts of a PubMed article.
Team Members
Syracuse Faculty and Staff
Bei Yu
Professor
320 Hinds Hall
byu@syr.edu
Additional Staff
- Yingya Li
PhD Candidate
School of Information Studies
Syracuse University - Jun Wang
Research Scientist
Syracuse, NY