Ivorde Unix/Linux/Database/Web/Mail Forum

View unanswered posts
View active topics
It is currently Wed Feb 08, 2012 8:15 am


Tips & Tricks, Questions regarding shell scripts, awk, perl, sed and much more.

Bookmark this article:

Author Message
LaR3
Post  Post subject: How to convert a text into a list of words, one per line  |  Posted: Wed Aug 05, 2009 2:02 pm

Joined: Mon Aug 03, 2009 11:55 pm
Posts: 19

Offline
Save on Delicious
Depending on the type of processing you need to do on any file, it's possible you meight need to convert it into a list of words, with one word per line.

Code:
# cat test.file
FreeBSD 7.2-RELEASE is now available for the amd64, i386, ia64, pc98, powerpc, and sparc64 architectures.

FreeBSD 7.2 can be installed from bootable ISO images or over the network; the required files can be downloaded via FTP or BitTorrent as described in the sections below. While some of the smaller FTP mirrors may not carry all architectures, they will all generally contain the more common ones, such as i386 and amd64.


Turning test.file into a list of words with tr Unix utility is simple:
Code:
# cat test.file | tr ' ' '\n'
FreeBSD
7.2-RELEASE
is
now
available
for
the
amd64,
...


But this does not solve the problem with non-alpha characters.

This is how to transform the text contained in a file into a list of words removing non-alpha characters
Code:
# cat test.file| tr -cs "[:alpha:]" "\n"
FreeBSD
RELEASE
is
now
available
for
the
amd
...


This is how to transform the text contained in a file into a list of words removing only punctuation marks
Code:
# cat test.file | tr -d '[:punct:]' | tr ' ' '\n'
FreeBSD
72RELEASE
is
now
available
for
the
amd64
...


Post here your questions/opinions.


Top
Display posts from previous:  Sort by  
Print view

Topics related to - "How to convert a text into a list of words, one per line"
 Topics   Author   Replies   Views   Last post 
There are no new unread posts for this topic. How to get the word frequency in a text

LaR3

0

1049

Wed Aug 05, 2009 2:01 pm

LaR3 View the latest post

 

Who is online
Users browsing this forum: No registered users and 0 guests
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum
Jump to:  
News News Site map Site map SitemapIndex SitemapIndex RSS Feed RSS Feed Channel list Channel list


Delete all board cookies | The team | All times are UTC + 2 hours [ DST ]

Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group
DAJ Glass 2 template created by Dustin Baccetti

phpBB SEO