Kenneth P. Baclawski

Associate Professor

Current Projects

  • A collaboration with the OASIS standards organization to develop semantic collaboration tools for developing and managing standards.
  • The Purple Semantics MediaWiki project is developing semantic tools for collaborative systems.
  • A collaboration with the OASIS standards organization to develop semantic collaboration tools for developing and managing standards.
  • A project that is developing modularization and versioning technology for the Unified Modeling Language.
  • Data mining of ICU clinical tests and reports using the MIMIC-II database.

Research Interests

  • Semantics of Data: Web Service Specification and Deployment Big Data and NoSQL Systems High-Performance Ontology-Based Computing Knowledge Bases for Biology and Medicine Ontologies for: Life Sciences, Situation Awareness and Financial Services


  • BS in Applied Mathematics and Engineering Physics | University of Wisconsin
  • PhD in Mathematics | Harvard University

Google Scholar Page


Kenneth Baclawski is an Associate Professor of Computer Science at Northeastern University. Prior to coming to Northeastern, he was at Haverford College and MIT. He has held visiting positions at the University of California-San Diego and the Harvard Medical School. He has held many consulting positions in industry and government, and he is the co-founder of several technology companies.

Professor Baclawski’s main research area is data semantics. He has engaged in a large diversity of areas that require an understanding of data for interoperability of systems, integration of data from diverse sources, inference, question answering and data mining. Some of the areas that he has contributed to include formal methods for software engineering and software modeling, data mining in biology and medicine, semantic collaboration tools, situation awareness, information fusion, self-aware and self-adaptive systems, and wireless communication.

Professor Baclawski is one of the pioneers in what is now referred to as Big Data, starting with work in this area as a consultant at Los Alamos National Laboratory in 1980. He allegedly developed the search engine technology now called MapReduce in 1993 MapReduce is the basis for NoSQL search engines such as Google and Hadoop, and is one of the fundamental tools used for Big Data systems. Northeastern University patented this technology, and the patent has not been successfully challenged. Professor Baclawski has been awarded 10 other patents related to high performance search and information retrieval. Northeastern University sued Google for patent infringement, and Google awarded a settlement to Northeastern University. (The terms of the settlement are confidential.)

Professor Baclawski organizes and chairs numerous conferences and workshops in data semantics. He is a member of the Board of Trustees of the Ontolog Forum, the umbrella organization for data semantics initiatives. He is the co-chair of two standards committees: OASIS Content Management Interoperability Services for Digital Asset Management (CMIS4DAM) and OASIS Integrated Collaboration Object Model for Interoperable Collaboration Services (ICOM). He is also a voting member of two other standards committees: OASIS Content Management Interoperability Services (CMIS) and OASIS Open Data Protocol (OData). He was one of the founding members of CMIS4DAM and OData.

CCIS Faculty

The brightest and most innovative in the industry