Teen WordPress-toteutuksena sivustoa, jossa on pääsivuston lisäksi muutama blogi. Sivusto on sisällöllisesti yhtenäinen kokonaisuus ja blogien halutaan olevan näkyvä osa. Näinpä navigaatio, laidan bannerit ja haku ovat läpi sivuston yhtenäisiä.
Please note: this article predates my company Nettiapina Oy, and may not be accurate.
Bulk of the list was taken from Snowball and DBIx::FullTextSearch::StopList, but I’ve also added some common words that were useless for the website. Your needs may vary. The list is longer than the Swedish one that comes with mnoGoSearch, for example.
Couple of months ago I “inherited” a business website that has considerable amount of user-edited pages. The site was quite slow at times. The usual MySQL optimization tricks helped a little, but the site would still occasionally halt to a crawl. The Slow query log revealed that certain queries often took more than 15 seconds to run. The culprit seemed to be mnoGoSearch, although Googling and digging revealed no obvious reason for the poor performance.
Then it hit me: the system had no Finnish stopword list. The index database had over five million lines, and you could get lots of hits for BS queries like “maybe perhaps” (”kai ehkä”). After compiling the list and rebuilding the index, the number of rows went down to a more tolerable 250 000.
For search engines
suomi, suomeksi, sulkusana, lista, suomenkielinen sulkusanalista, stop word list, stop word list for mnogosearch