Name:
Robson Cordeiro

Contact info: Carnegie Mellon University
School of Computer Science
5000 Forbes Avenue
Pittsburgh, PA 15213

Project Scientist
January 2024 - current
Email:
robsonc AT andrew DOT cmu DOT edu
Phone:
(510)519-0845

Publications
Software
Resume (pdf, html, and txt)
Awards Portfolio



Book
Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Jr.
Data Mining in Large Sets of Complex Data
Springer Briefs in Computer Science
Springer 2013, ISBN 978-1-4471-4889-0, 116 pages. [Notable Book Award, by ACM]
Select Publications
  In reverse chronological order:
  1. Eugênio F. Cabral, Braulio Valentin Sanchez Vinces, Guilherme D. F. Silva, Jörg Sander, Robson L. F. Cordeiro. Efficient outlier detection in numerical and categorical data. Data Min. Knowl. Discov. 39(3): 18 (2025). Details     Download: [pdf] (5.5MB)
  2. Braulio Valentin Sanchez Vinces, Erich Schubert, Arthur Zimek, Robson L. F. Cordeiro. A comparative evaluation of clustering-based outlier detection. Data Min. Knowl. Discov. 39(2): 13 (2025). Details     Download: [pdf] (6.4MB)
  3. Leonardo Mauro Pereira Moraes, Felippe P. Ferreira, Robson L. F. Cordeiro. Modeling and analyzing Social Networks of Games. Expert Syst. Appl. 261: 125449 (2025). Details     Download: [pdf] (1.6MB)
  4. Braulio Valentin Sanchez Vinces, Robson L. F. Cordeiro, Christos Faloutsos. Mccatch: Scalable Microcluster Detection in Dimensional and Nondimensional Datasets. In Proceedings of the 40th IEEE International Conference on Data Engineering, ICDE 2024, 1407-1420.
    Utrecht, The Netherlands, May 13-16, 2024. Details     Download: [pdf] (7.7MB)
  5. Daniela F. Milon-Flores, Robson L. F. Cordeiro. How to take advantage of behavioral features for the early detection of grooming in online conversations. Knowl. Based Syst. 240: 108017 (2022). Details     Download: [pdf] (15.5MB)
  6. Shuli Jiang, Robson L. F. Cordeiro, Leman Akoglu. D.MCA: Outlier Detection with Explicit Micro-Cluster Assignments. In Proceedings of the IEEE International Conference on Data Mining, ICDM 2022, 987-992.
    Orlando, FL, USA, November 28 - Dec. 1, 2022. Details     Download: [pdf] (1.6MB)
  7. Matheus Aparecido do Carmo Alves, Robson L. F. Cordeiro. Effective and unburdensome forecast of highway traffic flow with adaptive computing. Knowl. Based Syst. 212: 106603 (2021) Details     Download: [pdf] (1.4MB)
  8. Jadson José Monteiro Oliveira, Robson Leonardo Ferreira Cordeiro. Unsupervised dimensionality reduction for very large datasets: Are we going to the right direction? Knowl. Based Syst. 196: 105777 (2020) Details     Download: [pdf] (6.1MB)
  9. Eugênio F. Cabral, Robson L. F. Cordeiro. Fast and Scalable Outlier Detection with Sorted Hypercubes. In Proceedings of the 29th ACM International Conference on Information and Knowledge Management - CIKM 2020, 95-104.
    Virtual Event, Ireland, October 19-23, 2020. Details     Download: [pdf] (877KB)
  10. Leonardo Mauro Pereira Moraes, Robson Leonardo Ferreira Cordeiro. Detecting Influencers in Very Large Social Networks of Games. In Proceedings of the 21st International Conference on Enterprise Information Systems - ICEIS 2019, 93-103.
    Heraklion, Crete, Greece, May 3-5, 2019. Details     Download: [pdf] (578KB) [Best Research Paper Award]
  11. André S. Gonzaga, Robson L. F. Cordeiro. The similarity-aware relational division database operator with case studies in agriculture and genetics. Inf. Syst. 82: 71-87 (2019) Details     Download: [pdf] (6MB)
  12. Gabriel P. Gimenes, Robson L. F. Cordeiro, José Fernando Rodrigues Jr. ORFEL: Efficient detection of defamation or illegitimate promotion in online recommendation. Inf. Sci. 379: 274-287 (2017) Details     Download: [pdf] (446KB)
  13. André S. Gonzaga, Robson Leonardo Ferreira Cordeiro. A New Division Operator to Handle Complex Objects in Very Large Relational Datasets. In Proceedings of the 20th International Conference on Extending Database Technology - EDBT 2017, 474-477.
    Venice, Italy, March 21-24, 2017. Download: [pdf] (4.4MB)
  14. Hugo Gualdron, Robson L. F. Cordeiro, José Fernando Rodrigues Jr., Duen Horng (Polo) Chau, Minsuk Kahng, U Kang. M-Flash: Fast Billion-Scale Graph Computation Using a Bimodal Block Processing Model. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases - ECML PKDD 2016, 623-640.
    Riva del Garda, Italy, September 19-23, 2016. Details     Download: [pdf] (403KB)
  15. Afonso Expedito Da Silva, Lucas L. Sanches, Antonio C. Fraideinberze, Robson L. F. Cordeiro. Halite_ds: Fast and Scalable Subspace Clustering for Multidimensional Data Streams. In Proceedings of the SIAM International Conference on Data Mining - SDM 2016, 351-359.
    Miami, Florida, USA, May 5-7, 2016. Details     Download: [pdf] (942KB)
  16. Ives Rene Venturini Pola, Robson L. F. Cordeiro, Caetano Traina Jr., Agma J. M. Traina. Similarity sets: A new concept of sets to seamlessly handle similarity in database management systems. Inf. Syst. 52: 130-148 (2015) Details     Download: [pdf] (2.2MB)
  17. Robson L. F. Cordeiro, Fan Guo, Donna S. Haverkamp, James H. Horne, Ellen K. Hughes, Gunhee Kim, Luciana A. S. Romani, Priscila P. Coltri, Tamires T. Souza, Agma J. M. Traina, Caetano Traina Jr., Christos Faloutsos. QuMinS: Fast and scalable querying, mining and summarizing multi-modal databases. Inf. Sci. 264: 211-229 (2014) Details     Download: [pdf] (16.5MB)
  18. Robson Leonardo Ferreira Cordeiro, Agma J. M. Traina, Christos Faloutsos, Caetano Traina Jr. Halite: Fast and Scalable Multiresolution Local-Correlation Clustering. IEEE Trans. Knowl. Data Eng. 25(2): 387-401 (2013) Details     Download: [pdf] (1.7MB)
  19. Ives Rene Venturini Pola, Robson Leonardo Ferreira Cordeiro, Caetano Traina Jr., Agma J. M. Traina. A New Concept of Sets to Handle Similarity in Databases: The SimSets. In Proceedings of the 6th International Conference on Similarity Search and Applications - SISAP 2013, 30-42.
    A Coruña, Spain, October 2-4, 2013. Details     Download: [pdf] (1.1MB) [Best Research Paper Award]
  20. Robson Leonardo Ferreira Cordeiro, Caetano Traina Jr., Agma Juci Machado Traina, Julio López, U Kang, Christos Faloutsos. Clustering very large multi-dimensional datasets with MapReduce. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD 2011, 690-698.
    San Diego, CA, USA, August 21-24, 2011. Details     Download: [pdf] (1.1MB)
  21. Robson Leonardo Ferreira Cordeiro, Agma J. M. Traina, Christos Faloutsos, Caetano Traina Jr. Finding Clusters in subspaces of very large, multi-dimensional datasets. In Proceedings of the 26th International Conference on Data Engineering - ICDE 2010, 625-636.
    Long Beach, California, USA, March 1-6, 2010. Details     Download: [pdf] (2MB)
  22. Robson Leonardo Ferreira Cordeiro, Fan Guo, Donna S. Haverkamp, James H. Horne, Ellen K. Hughes, Gunhee Kim, Agma J. M. Traina, Caetano Traina Jr., Christos Faloutsos. QMAS: Querying, Mining and Summarization of Multi-modal Databases. In Proceedings of the 10th IEEE International Conference on Data Mining - ICDM 2010 , 785-790.
    Sydney, Australia, 14-17 December 2010 Details     Download: [pdf] (2.2MB)
Software
  1. The BoW package.
    (Click to read the BoW paper).
  2. The Halite package.
    (Click to read the Halite paper).
  3. The QuMinS package.
    (Click to read the QuMinS paper).
  4. The C-AllOut package.
    (Click to read the C-AllOut paper).
  5. The FReE package.
    (Click to read the FReE paper).
  6. The ORFEL package.
    (Click to read the ORFEL paper).
  7. The Curl-Remover package.
    (Click to read the Curl-Remover paper).
  8. The HySortOD package.
    (Click to read the HySortOD paper).
  9. The McCatch package.
    (Click to read the McCatch paper).
  10. The D.MCA package.
    (Click to read the D.MCA paper).
  11. The M-Flash package.
    (Click to read the M-Flash paper).
  12. The BF-PSR package.
    (Click to read the BF-PSR paper).
  13. The GInfluencer package.
    (Click to read the GInfluencer paper).
  14. The ULEARn package.
    (Click to read the ULEARn paper).
Awards
  In reverse chronological order:
  1. Award for teaching excellence
    Selected among the best professors of the Computer Engineering Undergraduate Program.
    Award received from the São Carlos School of Engineering of the University of São Paulo - USP.
    São Carlos, SP, Brazil, 2021.
  2. Award for teaching excellence
    Selected among the best professors of the Civil Engineering Undergraduate Program.
    Award received from the São Carlos School of Engineering of the University of São Paulo - USP.
    São Carlos, SP, Brazil, 2020.
  3. Best Research Paper Award
    Paper chosen as one of the best of the 21st International Conference on Enterprise Information Systems - ICEIS 2019. Award received from INSTICC.
    Paper: Detecting Influencers in Very Large Social Networks of Games.
    Heraklion, Crete, Greece, 2019.
  4. Best CS Thesis Award (advisor) [Main Award]
    Msc Thesis ranked among best nine Brazilian CS Theses of 2017. Award received from the Brazilian Computer Society - SBC.
    Natal, RN, Brazil, 2018.
  5. Notable Computing Books and Articles Award [Main Award]
    Book chosen as one of the 'ACM Computing Reviews' Notable Computing Books and Articles'. Award received from the ACM Computing Reviews.
    Book: Data Mining in Large Sets of Complex Data.
    New York, NY, USA, 2013.
  6. Best Research Paper Award
    Paper chosen as one of the best of the 6th International Conference on Similarity Search and Applications - SISAP 2013. Award received from SISAP initiative.
    Paper: A New Concept of Sets to Handle Similarity in Databases: The SimSets.
    A Coruña, Spain, 2013.
  7. Best CS Dissertation Award [Main Award]
    Ranked first among the Brazilian CS PhD Dissertations of 2011. Award received from the Brazilian Computer Society - SBC.
    Curitiba, PR, Brazil, 2012.
  8. Best CS Dissertation Award
    Ranked first among the PhD Dissertations of the Graduate Program in Computer Science and Computational Mathematics defended in 2011. Award received from the University of São Paulo - USP.
    São Carlos, SP, Brazil, 2012.
Short Bio

Robson Cordeiro is a Project Scientist at Carnegie Mellon University and an Innovation Consultant at PNC Bank. He also worked as an Associate Professor at University of São Paulo for several years. With over 10 years of experience in industry and academia, he is a data science innovator and educator who conducts cutting-edge research and teaches in the field of Computer Science (CS). Robson has a PhD and a Postdoctorate degree in CS, with a focus on Large-scale Machine Learning, Data Mining, Database, and Information Retrieval. His PhD Dissertation won the "Best CS Dissertation Award" from the Brazilian Computer Society, and generated a book published by Springer, which was later chosen as one of the "Computing Reviews' Notable Computing Books and Articles" by ACM.

Robson is passionate about advancing the frontiers of knowledge and innovation in data science and sharing his expertise with the next generation of researchers and professionals. He has created and developed dozens of data science tools and algorithms that have been applied to various domains, such as finance, cyber-security, social networks, and health care. He has published over 40 scientific articles in world-class venues and co-authored a book that was recognized by ACM. He has also successfully advised and mentored 11 graduate students in their PhD or MSc projects and taught dozens of courses in both undergraduate and graduate CS programs. He is always looking for new challenges and opportunities to collaborate with other experts and organizations in data science.