|
-
8th March 06, 09:23 AM
#1
I don't know how the windows guys do it, but the Unix guys would do something like this:
"cat thedocument.txt | grep -c theword"
this would give you the count of how many times the text "theword" shows up.
I could write a script that would set up an array of the unique words in the document and then run that with each word variable then dump the results to a text file.
I'm sure the Windows guys have something that will do the same thing with one or two clicks. :rolleyes:
-
-
8th March 06, 09:27 AM
#2
If you're doing things like Shakespeare or the Bible, there are published concordances that you can use. I don't know how many works have accompanying concordances, though.
An uair a théid an gobhainn air bhathal 'se is feàrr a bhi réidh ris.
(When the smith gets wildly excited, 'tis best to agree with him.)
Kiltio Ergo Sum.
I Kilt, therefore I am. -McClef
-
-
8th March 06, 09:50 AM
#3
This class uses a variety of poems and short stories. The poems are fairly easy, but for the stories... I pasted a 4 page story into Word, and used the word search on each word seperately, while at the same time deleting the highlighted results ...it took hours!! ...and I still had to visually search the text for binaries (opposites) and word strands (words with related meanings).
I was just hoping that there would be a site online that would do the word count on anything you pluged into it.
...and yes, I'm using Windows.
If you're doing things like Shakespeare or the Bible, there are published concordances that you can use. I don't know how many works have accompanying concordances, though.
I'm taking a Shakespeare class from the same Prof. I'll keep that in mind if he decides to torture us there too, but I doubt he will. We're only giving two weeks to each play (one night class a week), so we're moving too fast to do any real in-depth analyzing.
-
-
8th March 06, 10:05 AM
#4
Always try to help my Windows using brothers
How does this sound:
Linguistic Inquiry and Word Count (LIWC) is a text analysis software program designed by James W. Pennebaker, Roger J. Booth, and Martha E. Francis. LIWC is able to calculate the degree to which people use different categories of words across a wide array of texts. Within emails, speeches, poems, or transcribed daily speech, LIWC allows you to determine the rate at which the authors/speakers use positive or negative emotion words, self-references, big words, or words that refer to sex, eating, or religion. The program was designed to analyze simply and quickly over 70 dimensions of language across hundreds of text samples in seconds.
Doesn't cost too much either, looks like a cool tool either way.
Link
-
-
8th March 06, 10:07 AM
#5
P.S. I think it does do word count per word.
-
-
8th March 06, 10:23 AM
#6
 Originally Posted by clancelt
Always try to help my Windows using brothers
How does this sound:
Linguistic Inquiry and Word Count (LIWC) is a text analysis software program designed by James W. Pennebaker, Roger J. Booth, and Martha E. Francis. LIWC is able to calculate the degree to which people use different categories of words across a wide array of texts. Within emails, speeches, poems, or transcribed daily speech, LIWC allows you to determine the rate at which the authors/speakers use positive or negative emotion words, self-references, big words, or words that refer to sex, eating, or religion. The program was designed to analyze simply and quickly over 70 dimensions of language across hundreds of text samples in seconds.
Doesn't cost too much either, looks like a cool tool either way.
Link
You're right, looks like an interesting tool, but I finally found this to use for free on-line. It does exactly what I wanted.
-
-
8th March 06, 07:28 PM
#7
 Originally Posted by clancelt
I don't know how the windows guys do it, but the Unix guys would do something like this:
"cat thedocument.txt | grep -c theword"
this would give you the count of how many times the text "theword" shows up.
I could write a script that would set up an array of the unique words in the document and then run that with each word variable then dump the results to a text file.
I'm sure the Windows guys have something that will do the same thing with one or two clicks. :rolleyes:
Even better would be:
tr -sc 'A-Za-z' '\012' <file.txt | sort | uniq -ci | sort -rn | more
Last edited by walkerk; 8th March 06 at 07:31 PM.
-
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
|
Bookmarks