Founded in 1999, the Knowledge Discovery and Machine Learning research group (DCAM) of PPGIa/PUCPR is currently composed of eight professors. The group conducts theoretical and applied research in machine learning, big data analytics, natural language processing, information retrieval, and computer vision. A detailed description of the group’s scientific production is available here (in Portuguese).

The group is constantly recruiting graduate students (Masters and PhD) to work on research and development projects in partnership with public and private companies. Interested applicants should explore the team page for more information about the group staff and its research topics.

Research Areas

Machine Learning

This line of research aims to advance the state-of-the-art of machine learning themes, a subfield of Artificial Intelligence that studies techniques to give the computer the ability to learn from examples using induction and employ the learned knowledge on new examples. Considering different real-world applications, data, and types of learning, including supervised, unsupervised, and semi-supervised learning, our research group focuses on classification, clustering, association, and regressions tasks. Among the studied themes, the following stand out: generation, selection, and fusion of classifiers, stream learning, representation learning, and deep learning;

Big Data Analytics

This line of research encompasses theoretical and practical advances in different data analysis branches given different Big Data scenarios and tools. These scenarios are characterized by massive amounts of potentially unstructured data which are made available over time and under high speed, which culminate in the need of specific algorithms and techniques for social, market, and industry advances.

Natural Language Processing

In this line of research our goal is to advance the state of the art for textual data processing focusing on Brazilian Portuguese. This type of data is everyday more pervasive in the Internet (social media, recommendation websites, etc) and in industry. This area has different research gaps and opportunities given its naturally ambiguous and noisy characteristics.

Information Retrieval

In this line of research, we focus on developing and applying techniques for data access, management, and usage. We approach information retrieval from different perspectives, including the study of human-machine interactions, multimedia applications, and efficient data indexing and querying. Consequently, we target further understanding how people access, interact with, use, and re-use information.

Computer Vision and Deep Learning

In this line of research, we focus on developing novel techniques for computer vision and deep learning techniques for audio, image, and video understanding. We target developing novel theories and applied models that broaden audio, image, and video processing, feature extraction, transfer learning, and style transfer.

Partnership and Funding