Clustering validation by distribution hypothesis learning
Statistics and Computing, ISSN: 1573-1375, Vol: 34, Issue: 6
2024
- 1Mentions
Metric Options: CountsSelecting the 1-year or 3-year option will change the metrics count to percentiles, illustrating how an article or review compares to other articles or reviews within the selected time period in the same journal. Selecting the 1-year option compares the metrics against other articles/reviews that were also published in the same calendar year. Selecting the 3-year option compares the metrics against other articles/reviews that were also published in the same calendar year plus the two years prior.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Metrics Details
- Mentions1
- News Mentions1
- News1
Most Recent News
New Data from National Scientific and Technical Research Council (CONICET) Illuminate Findings in Statistics and Computing (Clustering Validation By Distribution Hypothesis Learning)
2024 DEC 04 (NewsRx) -- By a News Reporter-Staff News Editor at Computer News Today -- Data detailed on Statistics and Computing have been presented.
Article Description
We present a new clustering validation technique named: “Hypothesis Learning”. We build our method on three concepts: (1) clustering cohesion, (2) clustering dispersion and, (3) hypothesis quality. The first two notions focus on individual cluster quality. We measure them using a classifier estimating the tightness and separation as a likelihood. The third notion evaluates the complexity of learning the clustering partition. Similar to cohesion and dispersion, we get a likelihood value. Next, we aggregate these three measures to find a single index reporting clustering quality. Previous methods from the literature have already used supervised and unsupervised algorithms and stability concepts to validate clustering solutions. Our motivation is not only to improve these methods but to use learning algorithms in a novel manner to learn key clustering concepts such as cohesion and dispersion. Furthermore, we include a technical discussion on how to regularize a classifier to handle overfit, thus explaining the symbiosis between supervised and unsupervised algorithms. In our experimental setup, we tested “Hypothesis Learning” with a fast classifier, K Nearest Neighbour (KNN). However, in the discussion of the method, we explore other classifiers like CART and Random Forest. The experimental results compare our approach with a similar method and many other well-known clustering indexes.
Bibliographic Details
Springer Science and Business Media LLC
Provide Feedback
Have ideas for a new metric? Would you like to see something else here?Let us know