cutes and tildes search (feature request)

General discussion of development ideas and the approaches taken in the 3.x branch of phpBB. The next feature release of phpBB 3 will be 3.3/Proteus.
Forum rules
Please do not post support questions regarding installing, updating, or upgrading phpBB 3.2.x. If you need support for phpBB 3.2.x please visit the 3.2.x Support Forum on phpbb.com.

If you have questions regarding writing extensions please post in Extension Writers Discussion to receive proper guidance from our staff and community.
Post Reply
schlabs
Registered User
Posts: 2
Joined: Mon Nov 16, 2015 11:16 am

cutes and tildes search (feature request)

Post by schlabs » Mon Nov 16, 2015 11:31 am

Hi:
First of all, sorry if this is not the appropiate section, please move if is needed.

I am spanish talker, and use the spanish translation very well. I found a little problem when the people try to use the search engine. The problem is in spanish and i imagine in other languages (like portuguese and french) too.

The situation is the follow:
User 1: post "the dog make pipi on tree", in spanish this spell "el perro hace pipi en el árbol"
Please note the tilde in the "á"

When the User 2 use the search engine is normal that write "perro" "arbol" (dog & tree). Please note that the tilde is missing.

The actual result is "not found", but the spected result is "1 found".
Since the the misspellings with tildes are over 50% in spanish world, this become a big problem to found historical post.

So i suggest that the phpbb include a fuzzy search that allow found the user 1 post regardles if speled "árbol" or "arbol" (tree). Google, ebay, alibaba, mercadolibre and other selling sites do it.

The fuzzy table can be:
a=ä=à=á
e=ë=è=é
i=ï=ì=í
o=ö=ò=ó
u=ü=ù=ú
c=ç ( to be corrected, this is used in catalan, french and portuguese but not in spanish).

thanks you in advance.

User avatar
Pony99CA
Registered User
Posts: 986
Joined: Sun Feb 08, 2009 2:35 am
Location: Hollister, CA
Contact:

Re: cutes and tildes search (feature request)

Post by Pony99CA » Thu Nov 19, 2015 1:58 am

Just FYI, tilde is the "~" symbol, often (only?) found in Spanish above an "n". You didn't ask for a mapping of that.

You're referring to accent marks on the "a", except for the mark below the "c", which is a cedilla.

I have no idea what "cutes" are, unless that's short for "acute".

However, search mapping probably depends on what search method you're using and possibly what language your database uses. As I'm not an expert on the search methods that phpBB supports, I'll let somebody else comment on their internationalization support.

Another thing to watch out for is sort orders. If searches aren't localized, you may get various national characters sorting out of order for the language in use.

Steve
Silicon Valley Pocket PC (http://www.svpocketpc.com)
Creator of manage_bots and spoof_user (ask me)
Need hosting for a small forum with full cPanel & MySQL access? Contact me or PM me.

schlabs
Registered User
Posts: 2
Joined: Mon Nov 16, 2015 11:16 am

Re: cutes and tildes search (feature request)

Post by schlabs » Thu Nov 19, 2015 2:53 pm

Is right, "cutes" is bad spelled. I am refering to accents and cedillas. Sorry i dont know how traduce to english and write cutes and tildes.

I agree with you, each language need their own fuzzy search.

Post Reply