Publications

Publications from members of the data mining group:

2024

Miriama Jánosová, Andreas Lang, Petra Budíková, Erich Schubert and Vlastislav Dohnal.
Advancing the PAM Algorithm to Semi-supervised k-Medoids Clustering
In: Proceedings of the 17th International Conference on Similarity Search and Applications (SISAP), Providence, RI, USA, 223-237, 2024.
[DOI: 10.1007/978-3-031-75823-2_19] | [BibTeX]
Erich Schubert.
Hierarchical Clustering Without Pairwise Distances by Incremental Similarity Search
In: Proceedings of the 17th International Conference on Similarity Search and Applications (SISAP), Providence, RI, USA, 238-252, 2024.
[DOI: 10.1007/978-3-031-75823-2_20] | [BibTeX]
Erik Thordsen and Erich Schubert.
Grouping Sketches to Index High-Dimensional Data in a Resource-Limited Setting
In: Proceedings of the 17th International Conference on Similarity Search and Applications (SISAP), Providence, RI, USA, 274-282, 2024.
[DOI: 10.1007/978-3-031-75823-2_23] | [BibTeX]
Lars Lenssen and Erich Schubert.
Medoid Silhouette clustering with automatic cluster number selection
In: Information Systems 120, 102290, 2024.
[DOI: 10.1016/J.IS.2023.102290] | [preprint (arXiv)] | [BibTeX]

2023

Melanie Derksen, Julia Becker, Mohammad Fazleh Elahi, Angelika Maier, Marius Maile, Ingo Pätzold, Jonas Penningroth, Bettina Reglin, Markus Rothgänger, Philipp Cimiano, Erich Schubert, Silke Schwandt, Thorsten W. Kuhlen, Mario Botsch and Tim Weisske.
Who Did What When? Discovering Complex Historical Interrelations in Immersive Virtual Reality
In: IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2023, Sydney, Australia, 2023.
[DOI: 10.1109/ISMAR59233.2023.00027] | [preprint] | [BibTeX]
Andreas Lang and Erich Schubert.
Accelerating k-Means Clustering with Cover Trees
In: Proceedings of the 16th International Conference on Similarity Search and Applications (SISAP), A Coruna, Spain, 2023.
[DOI: 10.1007/978-3-031-46994-7_13] | [BibTeX]
Erik Thordsen and Erich Schubert.
An Alternating Optimization Scheme for Binary Sketches for Cosine Similarity Search
In: Proceedings of the 16th International Conference on Similarity Search and Applications (SISAP), A Coruna, Spain, 2023, best student paper award + best paper award.
[DOI: 10.1007/978-3-031-46994-7_4] | [BibTeX]
Erich Schubert.
Stop using the elbow criterion for k-means and how to choose the number of clusters instead
In: ACM SIGKDD Explorations 25(1), 36--42, 2023.
[DOI: 10.1145/3606274.3606278] | [preprint] | [BibTeX]
Erich Schubert and Andreas Lang.
Data Aggregation for Hierarchical Clustering
In: Machine Learning under Resource Constraints -- Fundamentals 1, 215-226, 2023.
[DOI: 10.1515/9783110785944-005] | [preprint (arXiv)] | [BibTeX]
Lars Lenssen and Erich Schubert.
Sparse Partitioning Around Medoids
In: Machine Learning under Resource Constraints -- Fundamentals 1, 182-196, 2023.
[DOI: 10.1515/9783110785944-005] | [preprint (arXiv)] | [BibTeX]
Lars Lenssen, Niklas Strahmann and Erich Schubert.
Fast k-Nearest-Neighbor-Consistent Clustering
In: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Marburg, Germany, 2023, KDML best paper award.
[BibTeX]

2022

Lars Lenssen and Erich Schubert.
Clustering by Direct Optimization of the Medoid Silhouette
In: Similarity Search and Applications - 15th International Conference, SISAP 2022, Bologna, Italy, October 5-7, 2022, Proceedings, 190-204, 2022, best student paper award.
[DOI: 10.1007/978-3-031-17849-8_15] | [Preprint (arXiv)] | [BibTeX]
Erich Schubert.
Automatic Indexing for Similarity Search in ELKI
In: Similarity Search and Applications - 15th International Conference, SISAP 2022, Bologna, Italy, October 5-7, 2022, Proceedings, 205-213, 2022.
[DOI: 10.1007/978-3-031-17849-8_16] | [BibTeX]
Erik Thordsen and Erich Schubert.
On Projections to Linear Subspaces
In: Similarity Search and Applications - 15th International Conference, SISAP 2022, Bologna, Italy, October 5-7, 2022, Proceedings, 75-88, 2022.
[DOI: 10.1007/978-3-031-17849-8_7] | [Preprint (arXiv)] | [BibTeX]
Franka Bause, Erich Schubert and Nils M. Kriege.
EmbAssi: embedding assignment costs for similarity search in large graph databases
In: Data Min. Knowl. Discov. 36(5), 1728-1755, 2022.
[DOI: 10.1007/s10618-022-00850-3] | [BibTeX]
Andreas Lang and Erich Schubert.
BETULA: Fast clustering of large data with improved BIRCH CF-Trees
In: Inf. Syst. 108, 101918, 2022.
[DOI: 10.1016/j.is.2021.101918] | [BibTeX]
Erik Thordsen and Erich Schubert.
ABID: Angle Based Intrinsic Dimensionality - Theory and analysis
In: Inf. Syst. 108, 101989, 2022.
[DOI: 10.1016/j.is.2022.101989] | [BibTeX]
Erich Schubert and Lars Lenssen.
Fast k-medoids Clustering in Rust and Python
In: J. Open Source Softw. 7(75), 4183, 2022.
[DOI: 10.21105/joss.04183] | [BibTeX]
Erich Schubert and Lars Lenssen.
Fast k-medoids Clustering in Rust and Python
Open-Source software, Zenodo, 2022.
[DOI: 10.5281/zenodo.6802320] | [BibTeX]
Daniel Boiar, Thomas Liebig and Erich Schubert.
LOSDD: Leave-Out Support Vector Data Description for Outlier Detection
In: CoRR abs/2212.13626, 2022.
[DOI: 10.48550/arXiv.2212.13626] | [BibTeX]

2021

Franka Bause, David B. Blumenthal, Erich Schubert and Nils M. Kriege.
Metric Indexing for Graph Similarity Search
In: Similarity Search and Applications - 14th International Conference, SISAP 2021, Dortmund, Germany, September 29 - October 1, 2021, Proceedings, 323-336, 2021.
[DOI: 10.1007/978-3-030-89657-7_24] | [Preprint (arXiv)] | [BibTeX]
Erich Schubert.
A Triangle Inequality for Cosine Similarity
In: Similarity Search and Applications - 14th International Conference, SISAP 2021, Dortmund, Germany, September 29 - October 1, 2021, Proceedings, 32-44, 2021.
[DOI: 10.1007/978-3-030-89657-7_3] | [Preprint (arXiv)] | [BibTeX]
Erich Schubert, Andreas Lang and Gloria Feher.
Accelerating Spherical k-Means
In: Similarity Search and Applications - 14th International Conference, SISAP 2021, Dortmund, Germany, September 29 - October 1, 2021, Proceedings, 217-231, 2021.
[DOI: 10.1007/978-3-030-89657-7_17] | [Preprint (arXiv)] | [BibTeX]
Erik Thordsen and Erich Schubert.
MESS: Manifold Embedding Motivated Super Sampling
In: Similarity Search and Applications - 14th International Conference, SISAP 2021, Dortmund, Germany, September 29 - October 1, 2021, Proceedings, 232-246, 2021.
[DOI: 10.1007/978-3-030-89657-7_18] | [Preprint (arXiv)] | [BibTeX]
Erich Schubert and Peter J. Rousseeuw.
Fast and eager k-medoids clustering: O(k) runtime improvement of the PAM, CLARA, and CLARANS algorithms
In: Inf. Syst. 101, 101804, 2021.
[DOI: 10.1016/j.is.2021.101804] | [BibTeX]
Karin Boczek, Erich Schubert, Jonas Rieger, Carsten Jentsch, Henrik Müller and Jörg Rahnenführer.
Automatisierter Programmauftrag: Auswirkungen algorithmischer Gewichtungen auf die Verbreitung von Journalismus in öffentlich-rechtlichen Mediatheken
In: Innovationen im Journalismus -- Jahrestagung FG Journalistik / Journalismusforschung der DGPuK, 2021, abstract, presentation.
[BibTeX]
Karin Boczek, Erich Schubert, Jonas Rieger, Carsten Jentsch, Henrik Müller and Jörg Rahnenführer.
Mapping the news one transcript at a time: Using subtitle data from streaming services to analyse journalistic coverage
In: Future of Journalism Conference, Cardiff, 2021, abstract, presentation.
[BibTeX]
Erich Schubert.
HACAM: Hierarchical Agglomerative Clustering Around Medoids - and its Limitations
In: Proceedings of the LWDA 2021 Workshops: FGWM, KDML, FGWI-BIA, and FGIR, Online, September 1-3, 2021, 191-204, 2021.
[online] | [BibTeX]
Erik Thordsen and Erich Schubert.
CANDLE: Classification And Noise Detection With Local Embedding Approximations
In: Proceedings of the LWDA 2021 Workshops: FGWM, KDML, FGWI-BIA, and FGIR, Online, September 1-3, 2021, 219-231, 2021.
[online] | [BibTeX]

2020

Irena Koprinska, Michael Kamp, Annalisa Appice, Corrado Loglisci, Luiza Antonie, Albrecht Zimmermann, Riccardo Guidotti, Özlem Özgöbek, Rita P. Ribeiro, Ricard Gavaldà, João Gama, Linara Adilova, Yamuna Krishnamurthy, Pedro M. Ferreira, Donato Malerba, Ibéria Medeiros, Michelangelo Ceci, Giuseppe Manco, Elio Masciari, Zbigniew W. Ras, Peter Christen, Eirini Ntoutsi, Erich Schubert, Arthur Zimek, Anna Monreale, Przemyslaw Biecek, Salvatore Rinzivillo, Benjamin Kille, Andreas Lommatzsch and Jon Atle Gulla (eds).
ECML PKDD 2020 Workshops - Workshops of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2020): SoGood 2020, PDFL 2020, MLCS 2020, NFMCP 2020, DINA 2020, EDML 2020, XKDD 2020 and INRA 2020, Ghent, Belgium, September 14-18, 2020, Proceedings
Communications in Computer and Information Science 1323, 2020.
[DOI: 10.1007/978-3-030-65965-3] | [BibTeX]
Andreas Lang and Erich Schubert.
BETULA: Numerically Stable CF-Trees for BIRCH Clustering
In: Similarity Search and Applications - 13th International Conference, SISAP 2020, Copenhagen, Denmark, September 30 - October 2, 2020, Proceedings, 281-296, 2020.
[DOI: 10.1007/978-3-030-60936-8_22] | [Preprint (arXiv)] | [BibTeX]
Erik Thordsen and Erich Schubert.
ABID: Angle Based Intrinsic Dimensionality
In: Similarity Search and Applications - 13th International Conference, SISAP 2020, Copenhagen, Denmark, September 30 - October 2, 2020, Proceedings, 218-232, 2020.
[DOI: 10.1007/978-3-030-60936-8_17] | [Preprint (arXiv)] | [BibTeX]
Eirini Ntoutsi, Erich Schubert, Arthur Zimek and Albrecht Zimmermann.
Call for Special Issue Papers: Evaluation and Experimental Design in Data Mining and Machine Learning
In: Big Data 8(4), 253-254, 2020.
[DOI: 10.1089/big.2020.29037.cfp] | [BibTeX]

2019

Erich Schubert and Peter J. Rousseeuw.
Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms
In: Proceedings of the 12th International Conference on Similarity Search and Applications (SISAP), Newark, NJ, 171-187, 2019.
[DOI: 10.1007/978-3-030-32047-8_16] | [preprint (arXiv)] | [BibTeX]
Laurent Amsaleg, Michael E. Houle and Erich Schubert.
Introduction to Special Issue of the 9th International Conference on Similarity Search and Applications (SISAP 2016)
In: Inf. Syst. 80, 107, 2019.
[DOI: 10.1016/j.is.2018.11.006] | [BibTeX]
Eirini Ntoutsi, Erich Schubert, Arthur Zimek and Albrecht Zimmermann (eds).
Proceedings of the 1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning co-located with SIAM International Conference on Data Mining (SDM 2019), Calgary, Alberta, Canada
CEUR Workshop Proceedings 2436, 2019.
[online] | [BibTeX]
Eirini Ntoutsi, Erich Schubert, Arthur Zimek and Albrecht Zimmermann.
1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning (EDML 2019)
In: Proceedings of the 1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning co-located with SIAM International Conference on Data Mining (SDM 2019), Calgary, Alberta, Canada, 1-3, 2019.
[online] | [BibTeX]
Erich Schubert and Arthur Zimek.
ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg"
In: CoRR abs/1902.03616, 2019.
[online] | [BibTeX]

2018

Michael E. Houle, Erich Schubert and Arthur Zimek.
On the Correlation Between Local Intrinsic Dimensionality and Outlierness
In: Proceedings of the 11th International Conference on Similarity Search and Applications (SISAP), Lima, Peru, 177-191, 2018.
[DOI: 10.1007/978-3-030-02224-2_14] | [BibTeX]
Erich Schubert and Michael Gertz.
Numerically Stable Parallel Computation of (Co-)Variance
In: Proceedings of the 30th International Conference on Scientific and Statistical Database Management (SSDBM), Bolzano-Bozen, Italy, 10:1-10:12, 2018, SSDBM 2018 best paper award.
[DOI: 10.1145/3221269.3223036] | [slides (pdf)] | [manuscript (pdf)] | [BibTeX]
Arthur Zimek and Erich Schubert.
Outlier Detection
In: Encyclopedia of Database Systems, Second Edition, 2018.
[DOI: 10.1007/978-1-4614-8265-9_80719] | [BibTeX]
Erich Schubert, Andreas Spitz and Michael Gertz.
Exploring Significant Interactions in Live News
In: Proceedings of the 2nd International Workshop on Recent Trends in News Information Retrieval (NewsIR'18) co-located with 40th European Conference on Information Retrieval (ECIR 2018), Grenoble, France, 39-44, 2018.
[open-access (CEUR-WS)] | [BibTeX]
Erich Schubert and Michael Gertz.
Improving the Cluster Structure Extracted from OPTICS Plots
In: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Mannheim, Germany, 318-329, 2018.
[open-access (CEUR-WS)] | [code] | [BibTeX]
Erich Schubert, Sibylle Hess and Katharina Morik.
The Relationship of DBSCAN to Matrix Factorization and Spectral Clustering
In: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Mannheim, Germany, 330-334, 2018.
[online] | [open-access (CEUR-WS)] | [BibTeX]

2017

Evelyn Kirner, Erich Schubert and Arthur Zimek.
Good and Bad Neighborhood Approximations for Outlier Detection Ensembles
In: Proceedings of the 10th International Conference on Similarity Search and Applications (SISAP), Munich, Germany, 173-187, 2017.
[DOI: 10.1007/978-3-319-68474-1_12] | [slides (pdf)] | [manuscript (pdf)] | [code] | [BibTeX]
Erich Schubert and Michael Gertz.
Intrinsic t-Stochastic Neighbor Embedding for Visualization and Outlier Detection - A Remedy Against the Curse of Dimensionality?
In: Proceedings of the 10th International Conference on Similarity Search and Applications (SISAP), Munich, Germany, 188-203, 2017.
[DOI: 10.1007/978-3-319-68474-1_13] | [slides (pdf)] | [manuscript (pdf)] | [code] | [BibTeX]
Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
The (black) art of runtime evaluation: Are we comparing algorithms or implementations?
In: Knowledge and Information Systems (KAIS) 52(2), 341-378, 2017, first online 2016.
[DOI: 10.1007/s10115-016-1004-2] | [authorized access (Springer)] | [BibTeX]
Guillaume Casanova, Elias Englmeier, Michael E. Houle, Peer Kröger, Michael Nett, Erich Schubert and Arthur Zimek.
Dimensional Testing for Reverse k-Nearest Neighbor Search
In: Proc. VLDB Endow. 10(7), 769-780, 2017.
[DOI: 10.14778/3067421.3067426] | [online] | [BibTeX]
Erich Schubert, Jörg Sander, Martin Ester, Hans-Peter Kriegel and Xiaowei Xu.
DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN
In: ACM Trans. Database Syst. (TODS) 42(3), 19:1-19:21, 2017.
[DOI: 10.1145/3068335] | [authorized access (ACM)] | [BibTeX]
Erich Schubert, Andreas Spitz, Michael Weiler, Johanna Geiß and Michael Gertz.
Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding
In: CoRR abs/1708.03569, 2017.
[open-access (arXiv)] | [BibTeX]

2016

Laurent Amsaleg, Michael E. Houle and Erich Schubert (eds).
Similarity Search and Applications - 9th International Conference, SISAP 2016, Tokyo, Japan, October 24-26, 2016. Proceedings
Lecture Notes in Computer Science 9939, 2016.
[DOI: 10.1007/978-3-319-46759-7] | [conference homepage] | [BibTeX]
Erich Schubert, Michael Weiler and Hans-Peter Kriegel.
SPOTHOT: Scalable Detection of Geo-spatial Events in Large Textual Streams
In: Proceedings of the 28th International Conference on Scientific and Statistical Database Management (SSDBM), Budapest, Hungary, 8:1-8:12, 2016.
[DOI: 10.1145/2949689.2949699] | [authorized access (ACM)] | [preprint (pdf)] | [BibTeX]
Guilherme O. Campos, Arthur Zimek, Jörg Sander, Ricardo J. G. B. Campello, Barbora Micenková, Erich Schubert, Ira Assent and Michael E. Houle.
On the Evaluation of Unsupervised Outlier Detection: Measures, Datasets, and an Empirical Study
In: Data Mining and Knowledge Discovery 30(4), 891-927, 2016, Awarded ``ACM Computing Reviews Notable Books and Articles 2016''.
[DOI: 10.1007/s10618-015-0444-8] | [authorized access (Springer)] | [data and results] | [BibTeX]
Guilherme O. Campos, Arthur Zimek, Jörg Sander, Ricardo J. G. B. Campello, Barbora Micenková, Erich Schubert, Ira Assent and Michael E. Houle.
On the Evaluation of Outlier Detection: Measures, Datasets, and an Empirical Study Continued
In: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Potsdam, Germany, 2016.
[ abstract (pdf)] | [ slides (pdf)] | [ poster (pdf)] | [data and results] | [BibTeX]
Erich Schubert, Michael Weiler and Hans-Peter Kriegel.
Scalable Detection of Emerging Topics and Geo-spatial Events in Large Textual Streams
In: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Potsdam, Germany, 2016.
[ abstract (pdf)] | [ slides (pdf)] | [poster (pdf)] | [BibTeX]

2015

Erich Schubert, Arthur Zimek and Hans-Peter Kriegel.
Fast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles
In: Proceedings of the 20th International Conference on Database Systems for Advanced Applications (DASFAA), Hanoi, Vietnam, 19-36, 2015.
[DOI: 10.1007/978-3-319-18123-3_2] | [preprint (pdf)] | [slides (pdf)] | [code] | [BibTeX]
Erich Schubert, Michael Weiler and Arthur Zimek.
Outlier Detection and Trend Detection: Two Sides of the Same Coin
In: 1st International Workshop on Event Analytics using Social Media Data at the 15th IEEE International Conference on Data Mining (ICDM), Atlantic City, NJ, 40-46, 2015.
[DOI: 10.1109/ICDMW.2015.79] | [preprint (pdf)] | [BibTeX]
Erich Schubert, Alexander Koos, Tobias Emrich, Andreas Züfle, Klaus Arthur Schmid and Arthur Zimek.
A Framework for Clustering Uncertain Data
In: Proc. VLDB Endow. 8(12), 1976-1979, 2015.
[DOI: 10.14778/2824032.2824115] | [online] | [open-access (VLDB)] | [code] | [BibTeX]
Erich Schubert and OpenStreetMap Contributors.
Fast Reverse Geocoder using OpenStreetMap data
Open Data LMU, 2015.
[DOI: 10.5282/ubm/data.61] | [code] | [data] | [BibTeX]

2014

Xuan-Hong Dang, Ira Assent, Raymond T. Ng, Arthur Zimek and Erich Schubert.
Discriminative Features for Identifying and Interpreting Outliers
In: Proceedings of the 30th International Conference on Data Engineering (ICDE), Chicago, IL, 88-99, 2014.
[DOI: 10.1109/ICDE.2014.6816642] | [preprint (pdf)] | [BibTeX]
Erich Schubert, Michael Weiler and Hans-Peter Kriegel.
SigniTrend: Scalable Detection of Emerging Topics in Textual Streams by Hashed Significance Thresholds
In: Proceedings of the 20th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), New York, NY, 871-880, 2014, Included in Wang, Wei. ``Data Science for Social Good - 2014 KDD Highlights.'' AAAI. 2015..
[DOI: 10.1145/2623330.2623740] | [authorized access (ACM)] | [preprint (pdf)] | [slides (pdf)] | [online demo (static)] | [BibTeX]
Erich Schubert, Arthur Zimek and Hans-Peter Kriegel.
Generalized Outlier Detection with Flexible Kernel Density Estimates
In: Proceedings of the 14th SIAM International Conference on Data Mining (SDM), Philadelphia, PA, 542-550, 2014.
[DOI: 10.1137/1.9781611973440.63] | [preprint (pdf)] | [code] | [BibTeX]
Erich Schubert, Arthur Zimek and Hans-Peter Kriegel.
Local Outlier Detection Reconsidered: a Generalized View on Locality with Applications to Spatial, Video, and Network Outlier Detection
In: Data Mining and Knowledge Discovery 28(1), 190-237, 2014, first online 2012.
[DOI: 10.1007/s10618-012-0300-z] | [authorized access (Springer)] | [code] | [BibTeX]

2013

Elke Achtert, Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
Interactive Data Mining with 3D-Parallel-Coordinate-Trees
In: Proceedings of the ACM International Conference on Management of Data (SIGMOD), New York City, NY, 1009-1012, 2013.
[DOI: 10.1145/2463676.2463696] | [ELKI] | [authorized access (ACM)] | [BibTeX]
Erich Schubert, Arthur Zimek and Hans-Peter Kriegel.
Geodetic Distance Queries on R-Trees for Indexing Geographic Data
In: Proceedings of the 13th International Symposium on Spatial and Temporal Databases (SSTD), Munich, Germany, 146-164, 2013.
[DOI: 10.1007/978-3-642-40235-7_9] | [code] | [BibTeX]
Erich Schubert.
Generalized and Efficient Outlier Detection for Spatial, Temporal, and High-Dimensional Data Mining
Ludwig-Maximilians-Universität München, Munich, Germany, 2013.
[online] | [Universitätsbibliothek] | [BibTeX]
Arthur Zimek, Erich Schubert and Hans-Peter Kriegel.
Outlier Detection in High-Dimensional Data
Tutorial at the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Gold Coast, Australia, 2013.
[slides (pdf)] | [BibTeX]

2012

Elke Achtert, Sascha Goldhofer, Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
Evaluation of Clusterings -- Metrics and Visual Support
In: Proceedings of the 28th International Conference on Data Engineering (ICDE), Washington, DC, 1285-1288, 2012.
[DOI: 10.1109/ICDE.2012.128] | [ELKI] | [BibTeX]
Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
Outlier Detection in Arbitrarily Oriented Subspaces
In: Proceedings of the 12th IEEE International Conference on Data Mining (ICDM), Brussels, Belgium, 379-388, 2012.
[DOI: 10.1109/ICDM.2012.21] | [code] | [BibTeX]
Erich Schubert, Remigius Wojdanowski, Arthur Zimek and Hans-Peter Kriegel.
On Evaluation of Outlier Rankings and Outlier Scores
In: Proceedings of the 12th SIAM International Conference on Data Mining (SDM), Anaheim, CA, 1047-1058, 2012.
[DOI: 10.1137/1.9781611972825.90] | [code] | [BibTeX]
Arthur Zimek, Erich Schubert and Hans-Peter Kriegel.
A Survey on Unsupervised Outlier Detection in High-Dimensional Numerical Data
In: Statistical Analysis and Data Mining 5(5), 363-387, 2012, Included in the ``most accessed papers from Statistical Analysis and Data Mining'' 2014--2016 r̆lhttps://onlinelibrary.wiley.com/page/journal/19321872/homepage/MostAccessed.html.
[DOI: 10.1002/sam.11161] | [more information] | [BibTeX]
Arthur Zimek, Erich Schubert and Hans-Peter Kriegel.
Tutorial I: Outlier Detection in High-Dimensional Data
In: Proceedings of the 12th IEEE International Conference on Data Mining (ICDM), Brussels, Belgium, xxx-xxxii, 2012.
[DOI: 10.1109/ICDM.2012.9] | [slides (pdf)] | [BibTeX]

2011

Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
Interpreting and Unifying Outlier Scores
In: Proceedings of the 11th SIAM International Conference on Data Mining (SDM), Mesa, AZ, 13-24, 2011.
[DOI: 10.1137/1.9781611972818.2] | [preprint (pdf)] | [code] | [BibTeX]
Elke Achtert, Ahmed Hettab, Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
Spatial Outlier Detection: Data, Algorithms, Visualizations
In: Proceedings of the 12th International Symposium on Spatial and Temporal Databases (SSTD), Minneapolis, MN, 512-516, 2011, Best Demonstration Paper Award.
[DOI: 10.1007/978-3-642-22922-0_41] | [ELKI] | [BibTeX]
Thomas Bernecker, Michael E. Houle, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Erich Schubert and Arthur Zimek.
Quality of Similarity Rankings in Time Series
In: Proceedings of the 12th International Symposium on Spatial and Temporal Databases (SSTD), Minneapolis, MN, 422-440, 2011.
[DOI: 10.1007/978-3-642-22922-0_25] | [BibTeX]
Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
Evaluation of Multiple Clustering Solutions
In: 2nd MultiClust Workshop: Discovering, Summarizing and Using Multiple Clusterings Held in Conjunction with ECML PKDD 2011, Athens, Greece, 55-66, 2011.
[online] | [open-access (CEUR-WS)] | [BibTeX]

2010

Elke Achtert, Hans-Peter Kriegel, Lisa Reichert, Erich Schubert, Remigius Wojdanowski and Arthur Zimek.
Visual Evaluation of Outlier Detection Models
In: Proceedings of the 15th International Conference on Database Systems for Advanced Applications (DASFAA), Tsukuba, Japan, 396-399, 2010.
[DOI: 10.1007/978-3-642-12098-5_34] | [ELKI] | [poster] | [BibTeX]
Thomas Bernecker, Tobias Emrich, Franz Graf, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Erich Schubert and Arthur Zimek.
Subspace Similarity Search Using the Ideas of Ranking and Top-k Retrieval
In: Proceedings of the 26th International Conference on Data Engineering (ICDE) Workshop on Ranking in Databases (DBRank), Long Beach, CA, 4-9, 2010.
[DOI: 10.1109/ICDEW.2010.5452771] | [more information] | [BibTeX]
Thomas Bernecker, Tobias Emrich, Franz Graf, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Erich Schubert and Arthur Zimek.
Subspace Similarity Search: Efficient k-NN Queries in Arbitrary Subspaces
In: Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, 555-564, 2010.
[DOI: 10.1007/978-3-642-13818-8_38] | [preprint (pdf)] | [more information] | [BibTeX]
Michael E. Houle, Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
Can Shared-Neighbor Distances Defeat the Curse of Dimensionality?
In: Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, 482-500, 2010.
[DOI: 10.1007/978-3-642-13818-8_34] | [preprint (pdf)] | [supplementary material] | [BibTeX]
Erich Schubert and Arthur Zimek.
ELKI Multi-View Clustering Data Sets Based on the Amsterdam Library of Object Images (ALOI)
Open data, Zenodo, 2010.
[DOI: 10.5281/zenodo.6355684] | [BibTeX]
Ines Färber, Stephan Günnemann, Hans-Peter Kriegel, Peer Kröger, Emmanuel Müller, Erich Schubert, Thomas Seidl and Arthur Zimek.
On Using Class-Labels in Evaluation of Clusterings
In: MultiClust: 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings Held in Conjunction with KDD 2010, Washington, DC, 2010.
[pdf] | [BibTeX]

2009

Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
LoOP: Local Outlier Probabilities
In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), Hong Kong, China, 1649-1652, 2009.
[DOI: 10.1145/1645953.1646195] | [pdf] | [authorized access (ACM)] | [code] | [BibTeX]
Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data
In: Proceedings of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Bangkok, Thailand, 831-838, 2009.
[DOI: 10.1007/978-3-642-01307-2_86] | [pdf] | [slides] | [code] | [BibTeX]
Elke Achtert, Thomas Bernecker, Hans-Peter Kriegel, Erich Schubert and Arthur Zimek.
ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series
In: Proceedings of the 11th International Symposium on Spatial and Temporal Databases (SSTD), Aalborg, Denmark, 436-440, 2009.
[DOI: 10.1007/978-3-642-02982-0_35] | [ELKI] | [pdf] | [poster] | [BibTeX]

2008

Hans-Peter Kriegel, Peer Kröger, Erich Schubert and Arthur Zimek.
A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms
In: Proceedings of the 20th International Conference on Scientific and Statistical Database Management (SSDBM), Hong Kong, China, 418-435, 2008.
[DOI: 10.1007/978-3-540-69497-7_27] | [preprint (pdf)] | [code] | [BibTeX]
Erich Schubert.
Statistical Approaches for Robustifying Correlation Clustering Algorithms
Ludwig-Maximilians-Universität München, Munich, Germany, 2008, Diploma thesis.
[BibTeX]

2005

Patrick F. Riley and Erich Schubert.
mReplay: Mobile Sports Replay and Fan Democracy
In: Axmedis 2005: Proceedings of the 1st International conference on Automated production of Cross Media content for Multi-channel distribution, 2005.
[DOI: 10.1400/41109] | [BibTeX]
Erich Schubert, Sebastian Schaffert and François Bry.
Structure-Preserving Difference Search for XML Documents
In: Proceedings of the Extreme Markup Languages 2005 Conference, Montreal, Quebec, Canada, 2005.
[online] | [open-access] | [code] | [BibTeX]
Erich Schubert.
Structure Preserving Difference Search in Semistructured Data
Ludwig-Maximilians-Universität München, Munich, Germany, 2005, Project thesis (undergraduate).
[BibTeX]