RE: [Corpora-List] Grep for Windows

From: Amsler, Robert (Robert.Amsler@hq.doe.gov)
Date: Fri Dec 15 2006 - 17:16:34 MET

  • Next message: maxwell@ldc.upenn.edu: "Re: [Corpora-List] Grep for Windows"

    FINDSTR, the Command Prompt MS-DOS command does what you want. It used
    to be quite fast, but when Microsoft started emulating MS-DOS it slowed
    down considerably, but it has the functionality you want. Beware of the
    collating sequence.

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of Mark Davies
    Sent: Friday, December 15, 2006 10:00 AM
    To: corpora@hd.uib.no
    Subject: [Corpora-List] Grep for Windows

    This next semester, I'd like to have the students in my Corpus
    Linguistics class learn to use Grep tools for searching large corpora. I
    know there's many great, fast Unix tools, but these students will be
    using Windows machines. If possible, the program would have the
    following features:

    -- Fast, since they'll be working with fairly large corpora (100 million
    words and more)
    -- Obviously, full regular expressions capability
    -- Not run under Cygwin or a similar program, but rather as a native
    Windows app

    I've already looked at PowerGrep, V-Grep, and TextPad, but none of these
    are adequate. Any other suggestions? Thanks in advance.

    Mark Davies

    ============================================
    Mark Davies
    Professor of (Corpus) Linguistics
    Brigham Young University
    (phone) 801-422-9168 / (fax) 801-422-0906
    Web: davies-linguistics.byu.edu

    ** Corpus design and use // Linguistic databases **
    ** Historical linguistics // Language variation **
    ** English, Spanish, and Portuguese **
    ============================================



    This archive was generated by hypermail 2b29 : Fri Dec 15 2006 - 17:15:06 MET