Re: data analysis

new topic     » goto parent     » topic index » view thread      » older message » newer message

On 17 Jan 2001, at 19:15, David Cuny wrote:

> Here's an updated version of the pattern matcher. The output is changed a bit
> (hopefully it's more clear), and I've added some more features. It now checks
> for swapped letters, and prefers removal over resyncing.
>
> Given the test data:
>
>    "this", "that"
>    "actually", "actaully"
>    "Pittsborough", "Pittsburg"
>    "Pittsberg", "Pittsburg"
>    "shello", "hellos"
>    "fhfellos", "shello"
>    "Sweigsdunka", "Zweigsdanka"
>
>
> It produces:
>
>    th[is,at]
>    act[ua,au]lly
>    Pittsb[o,u]r[ou,]g[h,]
>    Pittsb[e,u]rg
>    [s,]hello[,s]
>    [f,s]h[f,]ello[s,]
>    [S,Z]weigsd[u,a]nka

Ok, that breaks the feeding of MinG and MaxG to it. I still don't understand
the resyncing, but maybe i will one day. Did you see how the code i
posted returns a list of results? Comments?

Kat

new topic     » goto parent     » topic index » view thread      » older message » newer message

Search



Quick Links

User menu

Not signed in.

Misc Menu