Annotation of src/usr.bin/tr/tr.1, Revision 1.1
1.1 ! deraadt 1: .\" $NetBSD: tr.1,v 1.5 1994/12/07 08:35:13 jtc Exp $
! 2: .\"
! 3: .\" Copyright (c) 1991, 1993
! 4: .\" The Regents of the University of California. All rights reserved.
! 5: .\"
! 6: .\" This code is derived from software contributed to Berkeley by
! 7: .\" the Institute of Electrical and Electronics Engineers, Inc.
! 8: .\"
! 9: .\" Redistribution and use in source and binary forms, with or without
! 10: .\" modification, are permitted provided that the following conditions
! 11: .\" are met:
! 12: .\" 1. Redistributions of source code must retain the above copyright
! 13: .\" notice, this list of conditions and the following disclaimer.
! 14: .\" 2. Redistributions in binary form must reproduce the above copyright
! 15: .\" notice, this list of conditions and the following disclaimer in the
! 16: .\" documentation and/or other materials provided with the distribution.
! 17: .\" 3. All advertising materials mentioning features or use of this software
! 18: .\" must display the following acknowledgement:
! 19: .\" This product includes software developed by the University of
! 20: .\" California, Berkeley and its contributors.
! 21: .\" 4. Neither the name of the University nor the names of its contributors
! 22: .\" may be used to endorse or promote products derived from this software
! 23: .\" without specific prior written permission.
! 24: .\"
! 25: .\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
! 26: .\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
! 27: .\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
! 28: .\" ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
! 29: .\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
! 30: .\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
! 31: .\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
! 32: .\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
! 33: .\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
! 34: .\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
! 35: .\" SUCH DAMAGE.
! 36: .\"
! 37: .\" @(#)tr.1 8.1 (Berkeley) 6/6/93
! 38: .\"
! 39: .Dd June 6, 1993
! 40: .Dt TR 1
! 41: .Os
! 42: .Sh NAME
! 43: .Nm tr
! 44: .Nd translate characters
! 45: .Sh SYNOPSIS
! 46: .Nm tr
! 47: .Op Fl cs
! 48: .Ar string1 string2
! 49: .Nm tr
! 50: .Op Fl c
! 51: .Fl d
! 52: .Ar string1
! 53: .Nm tr
! 54: .Op Fl c
! 55: .Fl s
! 56: .Ar string1
! 57: .Nm tr
! 58: .Op Fl c
! 59: .Fl ds
! 60: .Ar string1 string2
! 61: .Sh DESCRIPTION
! 62: The
! 63: .Nm tr
! 64: utility copies the standard input to the standard output with substitution
! 65: or deletion of selected characters.
! 66: .Pp
! 67: The following options are available:
! 68: .Bl -tag -width Ds
! 69: .It Fl c
! 70: Complements the set of characters in
! 71: .Ar string1 ,
! 72: that is ``-c ab'' includes every character except for ``a'' and ``b''.
! 73: .It Fl d
! 74: The
! 75: .Fl d
! 76: option causes characters to be deleted from the input.
! 77: .It Fl s
! 78: The
! 79: .Fl s
! 80: option squeezes multiple occurrences of the characters listed in the last
! 81: operand (either
! 82: .Ar string1
! 83: or
! 84: .Ar string2 )
! 85: in the input into a single instance of the character.
! 86: This occurs after all deletion and translation is completed.
! 87: .El
! 88: .Pp
! 89: In the first synopsis form, the characters in
! 90: .Ar string1
! 91: are translated into the characters in
! 92: .Ar string2
! 93: where the first character in
! 94: .Ar string1
! 95: is translated into the first character in
! 96: .Ar string2
! 97: and so on.
! 98: If
! 99: .Ar string1
! 100: is longer than
! 101: .Ar string2 ,
! 102: the last character found in
! 103: .Ar string2
! 104: is duplicated until
! 105: .Ar string1
! 106: is exhausted.
! 107: .Pp
! 108: In the second synopsis form, the characters in
! 109: .Ar string1
! 110: are deleted from the input.
! 111: .Pp
! 112: In the third synopsis form, the characters in
! 113: .Ar string1
! 114: are compressed as described for the
! 115: .Fl s
! 116: option.
! 117: .Pp
! 118: In the fourth synopsis form, the characters in
! 119: .Ar string1
! 120: are deleted from the input, and the characters in
! 121: .Ar string2
! 122: are compressed as described for the
! 123: .Fl s
! 124: option.
! 125: .Pp
! 126: The following conventions can be used in
! 127: .Ar string1
! 128: and
! 129: .Ar string2
! 130: to specify sets of characters:
! 131: .Bl -tag -width [:equiv:]
! 132: .It character
! 133: Any character not described by one of the following conventions
! 134: represents itself.
! 135: .It \eoctal
! 136: A backslash followed by 1, 2 or 3 octal digits represents a character
! 137: with that encoded value.
! 138: To follow an octal sequence with a digit as a character, left zero-pad
! 139: the octal sequence to the full 3 octal digits.
! 140: .It \echaracter
! 141: A backslash followed by certain special characters maps to special
! 142: values.
! 143: .sp
! 144: .Bl -column
! 145: .It \ea <alert character>
! 146: .It \eb <backspace>
! 147: .It \ef <form-feed>
! 148: .It \en <newline>
! 149: .It \er <carriage return>
! 150: .It \et <tab>
! 151: .It \ev <vertical tab>
! 152: .El
! 153: .sp
! 154: A backslash followed by any other character maps to that character.
! 155: .It c-c
! 156: Represents the range of characters between the range endpoints, inclusively.
! 157: .It [:class:]
! 158: Represents all characters belonging to the defined character class.
! 159: Class names are:
! 160: .sp
! 161: .Bl -column
! 162: .It alnum <alphanumeric characters>
! 163: .It alpha <alphabetic characters>
! 164: .It blank <blank characters>
! 165: .It cntrl <control characters>
! 166: .It digit <numeric characters>
! 167: .It graph <graphic characters>
! 168: .It lower <lower-case alphabetic characters>
! 169: .It print <printable characters>
! 170: .It punct <punctuation characters>
! 171: .It space <space characters>
! 172: .It upper <upper-case characters>
! 173: .It xdigit <hexadecimal characters>
! 174: .El
! 175: .Pp
! 176: \." All classes may be used in
! 177: \." .Ar string1 ,
! 178: \." and in
! 179: \." .Ar string2
! 180: \." when both the
! 181: \." .Fl d
! 182: \." and
! 183: \." .Fl s
! 184: \." options are specified.
! 185: \." Otherwise, only the classes ``upper'' and ``lower'' may be used in
! 186: \." .Ar string2
! 187: \." and then only when the corresponding class (``upper'' for ``lower''
! 188: \." and vice-versa) is specified in the same relative position in
! 189: \." .Ar string1 .
! 190: \." .Pp
! 191: With the exception of the ``upper'' and ``lower'' classes, characters
! 192: in the classes are in unspecified order.
! 193: In the ``upper'' and ``lower'' classes, characters are entered in
! 194: ascending order.
! 195: .Pp
! 196: For specific information as to which ASCII characters are included
! 197: in these classes, see
! 198: .Xr ctype 3
! 199: and related manual pages.
! 200: .It [=equiv=]
! 201: Represents all characters or collating (sorting) elements belonging to
! 202: the same equivalence class as
! 203: .Ar equiv .
! 204: If
! 205: there is a secondary ordering within the equivalence class, the characters
! 206: are ordered in ascending sequence.
! 207: Otherwise, they are ordered after their encoded values.
! 208: An example of an equivalence class might be ``c'' and ``ch'' in Spanish;
! 209: English has no equivalence classes.
! 210: .It [#*n]
! 211: Represents
! 212: .Ar n
! 213: repeated occurrences of the character represented by
! 214: .Ar # .
! 215: This
! 216: expression is only valid when it occurs in
! 217: .Ar string2 .
! 218: If
! 219: .Ar n
! 220: is omitted or is zero, it is be interpreted as large enough to extend
! 221: .Ar string2
! 222: sequence to the length of
! 223: .Ar string1 .
! 224: If
! 225: .Ar n
! 226: has a leading zero, it is interpreted as an octal value, otherwise,
! 227: it's interpreted as a decimal value.
! 228: .El
! 229: .Pp
! 230: The
! 231: .Nm tr
! 232: utility exits 0 on success, and >0 if an error occurs.
! 233: .Sh EXAMPLES
! 234: The following examples are shown as given to the shell:
! 235: .sp
! 236: Create a list of the words in file1, one per line, where a word is taken to
! 237: be a maximal string of letters.
! 238: .sp
! 239: .D1 Li "tr -cs \*q[:alpha:]\*q \*q\en\*q < file1"
! 240: .sp
! 241: Translate the contents of file1 to upper-case.
! 242: .sp
! 243: .D1 Li "tr \*q[:lower:]\*q \*q[:upper:]\*q < file1"
! 244: .sp
! 245: Strip out non-printable characters from file1.
! 246: .sp
! 247: .D1 Li "tr -cd \*q[:print:]\*q < file1"
! 248: .Sh COMPATIBILITY
! 249: System V has historically implemented character ranges using the syntax
! 250: ``[c-c]'' instead of the ``c-c'' used by historic BSD implementations and
! 251: standardized by POSIX.
! 252: System V shell scripts should work under this implementation as long as
! 253: the range is intended to map in another range, i.e. the command
! 254: ``tr [a-z] [A-Z]'' will work as it will map the ``['' character in
! 255: .Ar string1
! 256: to the ``['' character in
! 257: .Ar string2.
! 258: However, if the shell script is deleting or squeezing characters as in
! 259: the command ``tr -d [a-z]'', the characters ``['' and ``]'' will be
! 260: included in the deletion or compression list which would not have happened
! 261: under an historic System V implementation.
! 262: Additionally, any scripts that depended on the sequence ``a-z'' to
! 263: represent the three characters ``a'', ``-'' and ``z'' will have to be
! 264: rewritten as ``a\e-z''.
! 265: .Pp
! 266: The
! 267: .Nm tr
! 268: utility has historically not permitted the manipulation of NUL bytes in
! 269: its input and, additionally, stripped NUL's from its input stream.
! 270: This implementation has removed this behavior as a bug.
! 271: .Pp
! 272: The
! 273: .Nm tr
! 274: utility has historically been extremely forgiving of syntax errors,
! 275: for example, the
! 276: .Fl c
! 277: and
! 278: .Fl s
! 279: options were ignored unless two strings were specified.
! 280: This implementation will not permit illegal syntax.
! 281: .Sh STANDARDS
! 282: The
! 283: .Nm tr
! 284: utility is expected to be
! 285: .St -p1003.2
! 286: compatible.
! 287: It should be noted that the feature wherein the last character of
! 288: .Ar string2
! 289: is duplicated if
! 290: .Ar string2
! 291: has less characters than
! 292: .Ar string1
! 293: is permitted by POSIX but is not required.
! 294: Shell scripts attempting to be portable to other POSIX systems should use
! 295: the ``[#*]'' convention instead of relying on this behavior.