Datapartitioning using the Hilbert space filling curves: effect on the speed of convergence of Fuzzy ARTMAP for large database problems.
 Citation data:

Neural networks : the official journal of the International Neural Network Society, ISSN: 08936080, Vol: 18, Issue: 7, Page: 96784
 Publication Year:
 2005
 Repository URL:
 http://stars.library.ucf.edu/facultybib2000/5044; http://stars.library.ucf.edu/facultybib/7309
 PMID:
 15922562
 DOI:
 10.1016/j.neunet.2005.01.007
 Author(s):
 Publisher(s):
 Tags:
 Neuroscience; Computer Science; FuzzyARTMAP; Hilbert spacefilling curve; data mining; datapartitioning; NEURALNETWORK; CLASSIFICATION; ARCHITECTURE; RECOGNITION; Computer Science; Artificial Intelligence
article description
The Fuzzy ARTMAP algorithm has been proven to be one of the premier neural network architectures for classification problems. One of the properties of Fuzzy ARTMAP, which can be both an asset and a liability, is its capacity to produce new nodes (templates) on demand to represent classification categories. This property allows Fuzzy ARTMAP to automatically adapt to the database without having to a priori specify its network size. On the other hand, it has the undesirable side effect that large databases might produce a large network size (node proliferation) that can dramatically slow down the training speed of the algorithm. To address the slow convergence speed of Fuzzy ARTMAP for large database problems, we propose the use of spacefilling curves, specifically the Hilbert spacefilling curves (HSFC). Hilbert spacefilling curves allow us to divide the problem into smaller subproblems, each focusing on a smaller than the original dataset. For learning each partition of data, a different Fuzzy ARTMAP network is used. Through this divideandconquer approach we are avoiding the node proliferation problem, and consequently we speedup Fuzzy ARTMAP's training. Results have been produced for a twoclass, 16dimensional Gaussian data, and on the Forest database, available at the UCI repository. Our results indicate that the Hilbert spacefilling curve approach reduces the time that it takes to train Fuzzy ARTMAP without affecting the generalization performance attained by Fuzzy ARTMAP trained on the original large dataset. Given that the resulting smaller datasets that the HSFC approach produces can independently be learned by different Fuzzy ARTMAP networks, we have also implemented and tested a parallel implementation of this approach on a Beowulf cluster of workstations that further speeds up Fuzzy ARTMAP's convergence to a solution for large database problems.