Announcements

Academic Position

  • Present 2015

    Assistant Professor

    Northeastern University, College of Computer and Information Science

Education

  • Ph.D. 2015

    Ph.D., Computer Science

    Cornell University

  • B.Sc.2009

    Intelligence Science and Technology

    Peking University

  • B.Econ.2009

    Economics

    Peking University

Grants

  • 2017–2021
    Intelligence Advanced Research Projects Activity
    "Machine Translation for English Retrieval of Information in Any Language (MATERIAL)", subcontract to USC ISI.
  • 2016–2018
    National Science Foundation CRII
    "Towards Abstractive Summarization of Meetings", PI.
  • 2017–2018
    Northeastern University Tier 1
    "Inferring Argument Structure from Online and Live Conversations", PI with Nick Beauchamp and Michelle Borkin.
  • 2017–2018
    Gift from Tencent AI Lab
  • 2017–2018
    Gift from Toutiao AI Lab
  • 2016–2017
    Northeastern University Tier 1
    "Dynamic Heterogeneous Information Networks for Intelligent Visual Forecasting", PI with Yun Raymond Fu.
  • 2016, 2017
    NVIDIA GPU Grants
  • image

    Xinyu Hua Site

    Ph.D. student (2016–Present)

    ACL outstanding short paper award, 2017

    Argument mining (ACL'17)

    Neural abstractive summarization (EMNLP NewSum'17)

  • image

    Lisa Fan Site

    Ph.D. student (2017–Present)

    International Conference on Intelligent Virtual Agents (IVA) best paper nomination, 2017

    Human robotics interaction (IVA'17)

Filter by type:

Sort by year:

Neural Argument Generation Augmented with Externally Retrieved Evidence

Xinyu Hua and Lu Wang
Conference Paper Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), 2018.

Microblog Conversation Recommendation via Joint Modeling of Topics and Discourse

Xingshan Zeng, Jing Li, Lu Wang, Nick Beauchamp, Sarah Shugars, and Kam-Fai Wong
Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018.

Joint Modeling of Content and Discourse Relations in Dialogues

Kechen Qin, Lu Wang, and Joseph Kim
Conference Paper Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), 2017.

Understanding and Detecting Supporting Arguments of Diverse Types

Xinyu Hua and Lu Wang
Conference Paper Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2017. ACL Outstanding Paper Award.

Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes

Lu Wang, Nick Beauchamp, Sarah Shugars, and Kechen Qin
Journal Paper Transactions of the Association for Computational Linguistics (TACL), 2017.

A Pilot Study of Domain Adaptation Effect for Neural Abstractive Summarization

Xinyu Hua and Lu Wang
Workshop Paper Proceedings of the EMNLP Workshop on New Frontiers in Summarization, 2017.

Weakly-Guided User Stance Prediction via Joint Modeling of Content and Social Interaction

Rui Dong, Yizhou Sun, Lu Wang, Yupeng Gu, and Yuan Zhong
Conference Paper Proceedings of International Conference on Information and Knowledge Management (CIKM), 2017.

Neural Network-Based Abstract Generation for Opinions and Arguments

Lu Wang and Wang Ling
Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.

Summarization and Sentiment Analysis for Understanding Socially-Generated Content

Lu Wang
Thesis Ph.D. Thesis, Cornell University, February 2016.

Socially-Informed Timeline Generation for Complex Events

Lu Wang, Claire Cardie, and Galen Marchetti
Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.

Query-Focused Opinion Summarization for User-Generated Content

Lu Wang, Hema Raghavan, Claire Cardie, and Vittorio Castelli
Conference Paper Proceedings of the 25th International Conference on Computational Linguistics (COLING), 2014.

A Piece of My Mind: A Sentiment Analysis Approach for Online Dispute Detection

Lu Wang and Claire Cardie
Conference Paper Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2014.

Improving Agreement and Disagreement Identification in Online Discussions with A Socially-Tuned Sentiment Lexicon

Lu Wang and Claire Cardie
Workshop Paper Proceedings of the ACL Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2014.

Leveraging Semantic Web Search and Browse Sessions for Multi-Turn Spoken Dialog Systems

Lu Wang, Larry Heck, and Dilek Hakkani-Tur
Conference Paper Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014. One of the Two Award Papers of Spoken Language Processing Student Travel Award. [link]

CornPittMich Sentiment Slot-Filling System at TAC 2014

Xilun Chen, Arzoo Katiyar, Xiaoan Yan, Lu Wang, Carmen Banea, Yoonjung Choi, Lingjia Deng, Claire Cardie, Rada Mihalcea, and Janyce Wiebe
Non-refereed Publication Proceedings of the TAC-KBP 2014 Workshop, 2014. Won Second Place in Sentiment Slot-Filling Track.

Cornell Expert Aided Query-focused Summarization (CEAQS): A Summarization Framework to PoliInformatics

Lu Wang, Parvaz Mahdabi, Joonsuk Park, Dinesh Puranam, Bishan Yang, and Claire Cardie
Non-refereed Publication NLP Unshared Task in PoliInformatics 2014.

A Sentence Compression Based Framework to Query-Focused Multi-Document Summarization

Lu Wang, Hema Raghavan, Vittorio Castelli, Radu Florian, and Claire Cardie
Conference Paper Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.

Domain-Independent Abstract Generation for Focused Meeting Summarization

Lu Wang and Claire Cardie
Conference Paper Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.

Unsupervised Topic Modeling Approaches to Decision Summarization in Spoken Meetings

Lu Wang and Claire Cardie
Conference Paper Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012. Best Paper Nomination.

Focused Meeting Summarization via Unsupervised Relation Extraction

Lu Wang and Claire Cardie
Conference Paper Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012.

Summarizing Decisions in Spoken Meetings

Lu Wang and Claire Cardie
Workshop Paper Proceedings of the ACL Workshop on Automatic Summarization for Different Genres, Media, and Languages, 2011.

Microblog Conversation Recommendation Corpus (First Release, 2018)

Movie Review and Online Argument Corpus (First Release, 2016)

Socially-Informed Timeline Generation Corpus (First Release, 2015)

  • New York Times, CNN, and BBC news articles and user comments on four major events happened in 2014.
    New York Times news articles and user comments in 2013.
  •  DATA (.zip)
     README (.txt)
  • This corpus is distributed together with:
    Socially-Informed Timeline Generation for Complex Events
    Lu Wang, Claire Cardie, and Galen Marchetti
    Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.

Wikipedia Disputed Discussion Corpus (First Release, 2016)

Selected Professional Services

  • Area Co-chair: NAACL 2018 (Summarization), ACL 2016 (Summarization)
  • Senior Program Committee: AAAI 2018
  • Tutorial Co-chair: EMNLP 2018
  • Workshop Co-organizer: Workshop on New Frontiers in Summarization at EMNLP 2017
  • Member, Program Committee:

    - ACL (2018, 2017, 2015, 2014, 2013)

    - NAACL 2015

    - EMNLP (2018, 2017, 2016, 2015, 2014)

    - ICML 2018

    - NIPS 2018

    - COLING 2016

    - AISTATS 2017

    - AAAI (2018, 2017, 2016)

    - IJCAI 2016

    - ICWSM (2018, 2017, 2014)

    - WWW 2014

  • Reviewer:

    - Transactions of the Association for Computational Linguistics (TACL)

    - Journal of Computational Linguistics (CL)

    - Journal of Artificial Intelligence Research (JAIR)

    - Communications of the ACM (CACM)

    - Transactions on Information Systems (TOIS)

    - Speech Communication

    - IEEE Transactions on Knowledge and Data Engineering (TKDE)

    - Transactions on Audio, Speech and Language Processing

  • Funding Agency Panelist: National Science Foundation (2017 for CISE and SBE, 2016 for CISE)

Current Course

  • Spring 2018

    CS 6120/4120 - Natural Language Processing. Northeastern University.

    Course Webpage

Past Courses

  • Fall 2017

    CS 6120/4120 - Natural Language Processing. Northeastern University.

    Course Webpage

  • Spring 2017

    CS 6140 - Machine Learning. Northeastern University.

    Course Webpage

  • Sprint 2016

    CS 6140 - Machine Learning. Northeastern University.

    Course Webpage

  • Fall 2015

    CS 7180 - Special Topics in Artifical Intelligence. Northeastern University.

    Course Webpage

To Prospective Ph.D. Students, Post-docs, and Visitors

I'm interested in doing research in natural language processing (NLP), where I design machine learning algorithms and computational models (e.g., neural networks, stochastic algorithms, probabilistic models, etc) for tasks on abstractive text summarization, language generation, argumentation mining, discourse analysis, and dialog analysis. I'm also interested in applying NLP and machine learning techniques for interdisciplinary subjects, e.g. computational social science. Specifically, there are four major ongoing research projects:

  • Abstractive Text Summarization: We tackle the challenge of extracting key information from large amounts of textual data. We aim to generate concise and informative summaries for different types of texts, ranging from news articles in traditional media, to socially-generated content in popular social media (e.g. comments, tweets, or blogs), and to government meetings (e.g. Federal Reserve board meetings). Neural network-based methods are developed, with traditional training methods, multi-task learning, or reinforcement learning-based methods.
  • Argumentation Mining: Arguments play an important role for decision-making processes and persuasion. We are interested in understanding how people argue with and influence others, as well as form their own opinions on topics of interest. Especially, we want to discover linguistic patterns that reflect these processes, and use them for social interaction analysis and prediction. NLP models for discourse and semantic analysis are investigated for this project.
  • Argument Generation: In this project we aim to automate the process of generating persuasive arguments for different targeting audience or based on user's information request. This would need a better understanding of many aspects of high quality arguments, including discourse, semantic, and linguistic style. There is a wide range of applications, from improving essay writing and online argumentation skills, to enhancing public deliberation and political debate quality.
  • Information Extraction: We work on semantic understanding of text from different domains, including both news articles and user comments. Concretely, our goal is to advance the state-of-the-art methods on mention detection, entity extraction, and entity linking tasks.

If any of these sounds interesting to you, please check out more about my research and papers at this website. My students and I are also part of Northeastern NLP group. If you find your research agenda might be aligned with mine and are interested in working with me, please fill in this external contact form and (for Ph.D. applicants) apply to College of Computer and Information Science (CCIS) at Northeastern University and mention my name in your application.

To Undergrad Students Interested in NLP Research

Please feel free to reach out using this external contact form . Prerequisites include being able to write code in some programming languages (e.g. Python, Java, C/C++) proficiently, and finishing courses in algorithms, multivariable calculus, probability, statistics, and linear algebra. It would be better that you have taken a course in natural language processing (CS4120/CS6120) or machine learning (CS4140/CS6140).