[ Help index ]

Search box

This is where you place your search phrase.

Click Search when you are ready to start.

It's always a good idea to tst your search string on a couple files so that you can refine it and check for errors. You may want to sample 2 to 3 files at random with the random n files feature and 5 to 10 sample matches per file with the random n matches per file feature.

While the software carries out the search, you have to option of interrupting the process by clicking the Interrupt button. Do this if it seems that the process is taking too long and/or you want to inspect the results before conducting a full search on the corpus.

You use regular expressions to craft your search.

For example:

Any word (for counting unique words per text) \w+
Words ending in 'al' \w+al
Words ending in 'al' or 'on' \w+(?:al|on)
Words ending in a vowel \w+[aeiou]
Words not ending in a vowel \w+[^aeiou]
Words starting with 're' re\w+
Words starting with 're' or 'pre' (?:re|pre)\w+

Wildcards and character sets

Character set [abcde]
Exclusion character set [^abcde]
Digit set [0-9]
Any digit \d
Any non space/any character, including punctuation and numbers \w

Search Finds
habl[eéoó] hable, hablé, hablo, habló
habla[^sn] hablan, hablar, hablad
\d+ 1, 2, 15, 3698, 1494
c\w[ñnm]\w cena, cama, como, caña, ...
¡Sí!{1,} ¡Sí!, ¡Sí!!, ¡Sí!!!, ¡Sí!!!!!, ...

Quantifiers

0 or more *
0 or 1 ?
1 or more +
from n to m {n,m}
n or more {n,}

Search Finds
constituci\wn\w* constitución, constituciones, constitucional, constitucionales, ...
tienes? tiene, tienes
est\w+ estoy, estás, estabas, estuvieron, estar, ...
¡{1,3}[Hh]ola!{1,3} ¡hola!, ¡Hola!, ¡¡hola!!, ¡¡Hola!!, ¡¡¡hola!!!, ¡¡¡Hola!!!, ¡Hola!!!!
¡Sí!{1,} ¡Sí!, ¡Sí!!, ¡Sí!!!, ¡Sí!!!!!, ...

Groups

Choices (?:segment|segment|segment)

Search Finds
estuv(?:e|iste|o|imos|istes|ieron) estuve, estuviste, estuvo, estuvimos, estuvisteis, estuvieron
(?:pre|pos)\w+ pretexto, preponer, posguerra, ...
(?:a|en|de) \w+ a casa, en Roma, de fiesta