Finding and replacing HTML entities used in resource #3 WebsUni (Webster's Words*) |
|
Command: paste <(grep -o "&[^&;]*;" WebsWords|sort|uniq -c) <(for i in `grep -o "&[^&;]*;" WebsWords|sort|uniq`; do echo $i `echo $i|./UniTrans`; done)|awk '{print $1, $2, $4}' Output: |
|
7 á á 6 â â 528 æ æ 5 à à 1 ã ã 8 ä ä 14 ç ç 232 é é 18 ê ê 41 è è 174 ë ë |
1 î î 14 ï ï 15 ñ ñ 1 ó ó 7 ô ô 179 œ œ 152 ö ö 3 û û 1 ù ù 12 ü ü |
lrwxrwxrwx. 1 ddailey ddailey 54 Jan 5 2017 WebsWords -> ../public_html/data/wordstudy/webster1913/WEBSTERwords
The program UniTrans is merely a chain of sed substitution commands: