Multisite-haku WordPressille

Teen WordPress-toteutuksena sivustoa, jossa on pääsivuston lisäksi muutama blogi. Sivusto on sisällöllisesti yhtenäinen kokonaisuus ja blogien halutaan olevan näkyvä osa. Näinpä navigaatio, laidan bannerit ja haku ovat läpi sivuston yhtenäisiä.

Testasin paria hakupluginia ja päädyin kokeilemaan kotimaista Relevanssia, joka lupaa parantaa haun antamia tuloksia ja tarjoaa myös toiminnon monisivustohakuun. [Read more...]

Finnish stopword list

Please note: this article predates my company Nettiapina Oy, and may not be accurate.

This is a Finnish stopword list that I’m using it with the mnoGoSearch engine.

Bulk of the list was taken from Snowball and DBIx::FullTextSearch::StopList, but I’ve also added some common words that were useless for the website. Your needs may vary. The list is longer than the Swedish one that comes with mnoGoSearch, for example.

Download Finnish stopword list

Background

Couple of months ago I “inherited” a business website that has considerable amount of user-edited pages. The site was quite slow at times. The usual MySQL optimization tricks helped a little, but the site would still occasionally halt to a crawl. The Slow query log revealed that certain queries often took more than 15 seconds to run. The culprit seemed to be mnoGoSearch, although Googling and digging revealed no obvious reason for the poor performance.

Then it hit me: the system had no Finnish stopword list. The index database had over five million lines, and you could get lots of hits for BS queries like “maybe perhaps” (”kai ehkä”). After compiling the list and rebuilding the index, the number of rows went down to a more tolerable 250 000.

For search engines
suomi, suomeksi, sulkusana, lista, suomenkielinen sulkusanalista, stop word list, stop word list for mnogosearch