Greenstone has developed—rather unfairly, we feel—a reputation as a ‘toy’ document system not capable of handling large-scale, enterprise level collections. While our latest ‘million page’ newspaper collections should help change this preconception, there are indeed some scalability issues encountered in large collections. Similar problems have been encountered in large-scale databases and have been answered by the use of distributed computing, [...]
Large-scale Greenstone collections using DB2
Greenstone has developed—rather unfairly, we feel—a reputation as a ‘toy’ document system not capable of handling large-scale, enterprise level collections. While our latest ‘million page’ newspaper collections should help change this preconception, there are indeed some scalability issues encountered in large collections. Similar problems have been encountered in large-scale databases and have been answered by the use of distributed computing, [...]