News featuring Brian Mayer

Data scientists combat hate crimes and other violence

Research associates Brian Mayer (top) and Nathan Self (bottom) meet virtually to review targeted violence events on the dashboard developed by the Sanghani Center.

About the series: Every complex problem has many multidisciplinary angles. Leveraging expertise and energy, Virginia Tech faculty and students serve humanity by addressing the world’s most difficult problems.

With risk of political and targeted violence on the rise across the United States, national and local leaders are asking Princeton University’s nonpartisan Bridging Divides Initiative (BDI) to provide them with more timely, reliable, and context-specific data on targeted violence events that could help them engage locally and better inform their policy decisions. 

As part of their response to this plea, BDI’s team of Princeton social scientists collaborated with data scientists at the Sanghani Center for Artificial Intelligence and Data Analytics to identify targeted violence events. These often include hate crimes and other incidents that target individuals because of their race, religion, sexual orientation, or other perceived characteristics. Click here to read more about this research.


Research award aims to develop new algorithms for information extraction and understanding from scholarly literature

Naren Ramakrishnan, Director of DAC and Professor in the Department of Computer Science

The Discovery Analytics Center has received a research award from the Center for Security and Emerging Technology (CSET) at Georgetown University to support data-informed analysis for policymakers  concerning emerging technologies and their security implications. DAC will develop methods to extract novel insights at scale from full-text analytics of publications to better understand emerging technologies and their prevalence, spatial and temporal trends, and relationships.

“Algorithmic components developed by DAC will go into a high-performance pipeline that enables inspection of extracted patterns as well as the lineage of data transformations underlying the patterns,” said Naren Ramakrishnan, the Thomas L. Phillips Professor of Engineering and DAC director, who is the principal investigator for the project.

Ramakrishnan’s team at DAC — which includes senior research associate Patrick Butler; research associate Brian Mayer; and three Ph.D. students — will develop a machine learning framework based on weak supervision to process full-text AI publications into extracted structured fields, such as information on computational platforms utilized, language and library dependencies, compute time, research methods, objective tasks, and links to source code and data resources.

The initial focus will be on arXiv as researchers evaluate and assess progress followed by extraction from China National Knowledge Infrastructure (CNKI) literature, which provides full-text articles from more than 8,000 Chinese journals covering natural sciences, engineering, technology, agriculture, medicine, and selected topics in economics and social sciences.

This project is providing DAC with the opportunity to build on its prior work in extracting information from news articles about civil unrest events.  It will also be informed by DAC’s experience with automated extraction of epidemiological line lists from disease reports, which is used to develop custom word embeddings aimed at recognizing the typical language patterns in how computational details are described in the scholarly literature.

“This project brings together machine learning, computational linguistics, and human-computer interaction capabilities to extract features at scale. The information we extract will be mapped over time to help identify key trends and potential gaps that can support analysts and policy makers at the CSET,” said Ramakrishnan.

“We are looking forward to seeing how this innovative work can help inform CSET’s analysis as we strive to inform the future of AI policy,” said Dewey Murdick, director of Data Science at CSET.

 

 

 


DAC and UrbComp actively participating at KDD 2018 with conference organization and research presentations

KDD Logo

The Discovery Analytics Center and the Urban Computing Certificate Program (funded through a National Science Foundation traineeship grant and administered through DAC) will be well represented at the 24th Annual  Association for Computing Machinery Special Interest Knowledge Discovery and Data Mining (KDD 2018) conference in London, August 19-23.

The overall theme of this year’s conference is data mining for social good.

Chandan Reddy, associate professor of computer science and DAC faculty, served as a poster co-chair for the KDD conference.

Naren Ramakrishnan, the Thomas L. Phillips Professor of Engineering and DAC director, served on the senior program committee for the KDD research track.

Aditya Prakash, assistant professor of computer science and DAC faculty, served on the committee for Health Day at KDD, held in conjunction with the conference, and is one of four organizers for epiDAMIK: Epidemiology meets Data Mining and Knowledge discovery, a Health Day workshop.

This workshop serves as a forum to discuss new insights into how data mining can play a bigger role in epidemiology and public health research. While the integration of data science methods into epidemiology has significant potential, it remains understudied, Prakash said.

The goal of the workshop is to raise the profile of this emerging research area of data-driven and computational epidemiology and create a venue for presenting state-of-the-art and in-progress results — in particular, results that would otherwise be difficult to present at a major data mining conference, including lessons learned in the “trenches.”

The paper, “Forecasting the Flu: Designing Social Network Sensors for Epidemics,” (B. Aditya Prakash; Naren Ramakrishnan; Huijuan Shao, K.S.M. Tozammel Hossain and Hao Wu, all DAC Ph.D. alumni; Madhav Marathe, professor of computer science and director of the Network Dynamics and Simulation Science Lab (NDSSL) at Virginia Tech; Anil Vullikanti, associate professor of computer science at NDSSL and Maleq Khan, assistant professor at Texas A&M University) will be presented at the epiDAMIK workshop by Prakash and Vullikanti.

An Urban Computing workshop is also scheduled in conjunction with KDD2018. The objective of this workshop is to provide professionals, researchers, and technologists with a single forum where they can discuss and share the state-of-the-art of the development and applications related to urban computing, present their ideas and contributions, and set future directions in innovative research for urban computing. It is particularly targeted to people who are interested in sensing/mining/understanding urban data so as to tackle challenges in cities and help better formulate the future of cities.

The following posters from DAC have been accepted for presentation at the workshop:

Additionally, a DAC alumnus, Prithwish Chakraborty, is running a third workshop taking place during the conference, Machine Learning for Medicine and Healthcare (MLMH).


Virginia Tech graduate students team up with D.C. transit to help enhance customer service

UrbComp students Bryse Flowers (left) and Farnaz Khaghani were on the student team working with WMATA. Behind them is Brian Mayer, project manager and research scientist at the Discovery Analytics Center, who oversaw the study.

Last fall, the Washington Metropolitan Area Transit Authority (WMATA) struck a partnership with Virginia Tech’s graduate program in urban computing for help in predicting its system’s on-time performance (OTP).

The resulting study, by a team of students enrolled in Introduction to Urban Computing, a computer science course in the UrbComp certificate program administered by the Discovery Analytics Center, is one of the first steps in connecting WMATA’s Rush Hour Promise — initiated in January 2018 to provide a refund to any customer delayed by 15 minutes or more during rush hour — to underlying service disruptions, according to Jordan Holt, senior performance analyst at WMATA.  Click here to read more about the collaboration.


DAC and BI lead DARPA’s Next Generation Social Science Project

brian & Chris

Brian Goode (left), from the Discovery Analytics Center, and Chris Kuhlman, from the Biocomplexity Institute at Virginia Tech, collaborate on developing models for large-scale social behavior.

DAC and the Biocomplexity Institute are leading a $3 million grant awarded by the Defense Advanced Research Projects Agency (DARPA) as part of the Next Generation Social Science (NGS2) program.  DAC and BI will conduct research that will streamline modeling processes, experimental design, and methodology in the social sciences. A major objective of the project is to make social science experiments rigorous, reproducible, and scalable to large populations.