Lu Wang is an Assistant Professor of College of Computer and Information Science at Northeastern University since 2015. She is also affiliated with NULab for Texts, Maps, and Networks. She completed her Ph.D. in the Department of Computer Science at Cornell University, under supervision of Professor Claire Cardie in 2015. She earned her bachelor degrees in Machine Intelligence and Economics from Peking University in 2009.

Lu's research is focused on natural language processing, computational social science, and machine learning. More specifically, Lu works on algorithms for abstractive text summarization, language generation, argumentation minining, and dialog analysis, as well as novel applications that apply such techniques to computational social science and other interdisciplinary subjects.


* For prospective Ph.D. students, please read this note.

* For undergrads who are interested in doing research with me, please read this note.

* I'm teaching CS6140 Machine Learning in spring 2017. Lecture notes and updates can be found on the course website.

* I'm co-organizing Workshop on New Frontiers in Summarization at EMNLP 2017.







Publications

Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes
Lu Wang, Nick Beauchamp, Sarah Shugars, and Kechen Qin.
Transactions of the Association for Computational Linguistics (TACL), 2017.
[pdf] [bibtex] [data]

Neural Network-Based Abstract Generation for Opinions and Arguments
Lu Wang and Wang Ling.
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.
[pdf] [bibtex] [data]

Summarization and Sentiment Analysis for Understanding Socially-Generated Content
Lu Wang.
Ph.D. Thesis, Cornell University, February 2016.
[pdf] [bibtex]

Socially-Informed Timeline Generation for Complex Events
Lu Wang, Claire Cardie, and Galen Marchetti.
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.
[pdf] [bibtex] [slides] [data]

Query-Focused Opinion Summarization for User-Generated Content
Lu Wang, Hema Raghavan, Claire Cardie, and Vittorio Castelli.
Proceedings of the 25th International Conference on Computational Linguistics (COLING), 2014.
[pdf] [bibtex] [slides]

A Piece of My Mind: A Sentiment Analysis Approach for Online Dispute Detection
Lu Wang and Claire Cardie.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2014.
[pdf] [bibtex] [slides] [data]

Improving Agreement and Disagreement Identification in Online Discussions with A Socially-Tuned Sentiment Lexicon
Lu Wang and Claire Cardie.
Proceedings of the ACL Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2014.
[pdf] [bibtex] [slides]

Leveraging Semantic Web Search and Browse Sessions for Multi-Turn Spoken Dialog Systems
Lu Wang, Larry Heck, and Dilek Hakkani-Tur.
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014.
[pdf] [bibtex] [poster]
One of the Two Award Papers of Spoken Language Processing Student Travel Award [link]

A Sentence Compression Based Framework to Query-Focused Multi-Document Summarization
Lu Wang, Hema Raghavan, Vittorio Castelli, Radu Florian, and Claire Cardie.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.
[pdf] [bibtex] [slides]

Domain-Independent Abstract Generation for Focused Meeting Summarization
Lu Wang and Claire Cardie.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.
[pdf] [bibtex] [slides]

Unsupervised Topic Modeling Approaches to Decision Summarization in Spoken Meetings
Lu Wang and Claire Cardie.
Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012.
[pdf] [bibtex] [slides]
Best Paper Nomination

Focused Meeting Summarization via Unsupervised Relation Extraction
Lu Wang and Claire Cardie.
Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012.
[pdf] [bibtex] [slides]

Summarizing Decisions in Spoken Meetings
Lu Wang and Claire Cardie.
Proceedings of the ACL Workshop on Automatic Summarization for Different Genres, Media, and Languages, 2011.
[pdf] [bibtex] [poster]

Non-Referred Publications

CornPittMich Sentiment Slot-Filling System at TAC 2014
Xilun Chen, Arzoo Katiyar, Xiaoan Yan, Lu Wang, Carmen Banea, Yoonjung Choi, Lingjia Deng, Claire Cardie, Rada Mihalcea, and Janyce Wiebe.
Proceedings of the TAC-KBP 2014 Workshop, 2014.
Won Second Place in Sentiment Slot-Filling Track

Cornell Expert Aided Query-focused Summarization (CEAQS): A Summarization Framework to PoliInformatics
Lu Wang, Parvaz Mahdabi, Joonsuk Park, Dinesh Puranam, Bishan Yang, and Claire Cardie.
NLP Unshared Task in PoliInformatics 2014.
[pdf] [bibtex] [data] [website]



Teaching

CS 6140 - Machine Learning. Northeastern University, Spring 2017. [course webpage]
CS 6140 - Machine Learning. Northeastern University, Spring 2016. [course webpage]
CS 7180 - Special Topics in Artificial Intelligence. Northeastern University, Fall 2015. [course webpage]



Grants

National Science Foundation CRII. "Towards Abstractive Summarization of Meetings", 2016 - 2018, PI.
Northeastern University Tier 1. "Dynamic Heterogeneous Information Networks for Intelligent Visual Forecasting", 2016 - 2017, PI with Yun Raymond Fu.
Northeastern University Tier 1. "Inferring Argument Structure from Online and Live Conversations", 2017 - 2018, PI with Nick Beauchamp and Michelle Borkin.
NVIDIA GPU Grant. 2016, PI.



Professional Activities

Area Co-chair: ACL 2016 (Summarization)
Workshop Co-organizer: Workshop on New Frontiers in Summarization at EMNLP 2017.
Member, Program Committee: ACL (2015, 2014, 2013), NAACL 2015, EMNLP (2016, 2015, 2014), COLING 2016, AISTATS 2017, AAAI 2016, IJCAI 2016, ICWSM (2017, 2014), WWW 2014
Reviewer: Journal of Artificial Intelligence Research, Speech Communication, IEEE Transactions on Knowledge and Data Engineering, Transactions on Audio, Speech and Language Processing



Data

Movie Review and Online Argument Corpus (First Released, 2016)

- Movie critics and their consensus from Rotten Tomatoes.
- Online arguments from idebate.org.

* DATA (.zip)
* README (.txt)
* This corpus is distributed together with:
   Neural Network-Based Abstract Generation for Opinions and Arguments
   Lu Wang and Wang Ling.
   Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.

Socially-Informed Timeline Generation Corpus (First Release, 2015)

- New York Times, CNN, and BBC news articles and user comments on four major events happened in 2014.
- New York Times news articles and user comments in 2013.

* DATA (.zip)
* README (.txt)
* This corpus is distributed together with:
   Socially-Informed Timeline Generation for Complex Events
   Lu Wang, Claire Cardie, and Galen Marchetti.
   Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.

Wikipedia Disputed Discussion Corpus (First Release, 2016)

- Dispute and non-dispute discussions from Wikipedia talkpages.

* DATA (.zip)
* README (.txt)
* This corpus is distributed together with:
   A Piece of My Mind: A Sentiment Analysis Approach for Online Dispute Detection
   Lu Wang and Claire Cardie.
   Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2014.