Remove Filters
Remove Blanks From Start Of Line
Removes spaces and tabs from the start of every line. This is equivalant to an unindent action.
Remove Blanks From End Of Line
Removes spaces and tabs from the end of every line. In general, these characters are never used and can be removed with no ill-effects.
Remove blank lines
Removes blank lines. Note, lines with spaces or tabs are not removed. Use the Remove Blanks From Start Of Line filters first to rectify this.
Remove multiple whitespace
Removes sequences of multiple spaces or tabs and replaces them with a single space. Useful for making documents look professional – multiple spaces look untidy.
Remove HTML and SGML
Use this filter to convert HTML documents to a readable format.
This filter removes HTML and XML markup tags i.e. everything including and between <> brackets.
If your file still has HTML special characters in it such as
®
etc, follow this filter with the included filter "html\HTML to text.fll" filter. This filter will replace the escaped special characters e.g. ® with (tm).
Remove backspaces
Remove backspaces ie all ASCII code 8s.
Remove matching/non-matching lines
TextPipe will remove any lines that match/don't match the given pattern. For more options, please see the Restrict to matching lines filter. Unlike that filter, this filter is not allowed to have any sub filters.
Remove lines from list
TextPipe will remove all lines that EXACTLY match any of the lines in the given text file, or in the text memo area (line breaks can be of any format).
Remove line range
Several different means of specifying the lines to remove are provided:
- Start line, End line. This removes all lines including and between the specified lines.
- Start line, width. Removes Width lines starting from (and including) line Start.
- End line, width. Removes Width lines backwards starting from (and including) line End.
- Start line to end of line. Removes all lines from the Start line to the very end of the line.
- Width to end of line. Removes Width lines backwards starting from (and including) the last line.
Remove lines matching pattern list
TextPipe will remove all lines that match any of the patterns in the given text file (or specified in the text memo area). Each pattern must be specified on a line of its own (line breaks can be of any format). Each line is interpreted as a perl-style pattern, with options of case-insensitive matching, non-greedy matching, comments not allowed. If you need to change these options, precede each line by the following codes:
| Change option command |
Result |
| (?-i) |
Case-sensitive matching |
| (?x) |
Comments allowed |
| (?U) |
Greedy matching |
Remove duplicate lines
Removes successive duplicate lines. The duplicate lines must be adjacent for this filter to remove them. This filter is most commonly used after a sort - where duplicate lines will be placed together. From a set of duplicate lines, the line that gets output is the last duplicate line of the set, unless the set starts on the first line of the file, in which case the first line gets output.
- Ignore case
If ignore case is checked, lines do not need to be cased identically to be considered duplicates. Two identical lines, one in upper case, and one in lower case, would be considered duplicates and removed by this filter. If ignore case is unchecked, the lines must be identical to be considered duplicates. The case checking routines are ANSI aware, so their behaviour may change depending on your locale.
- Start column
The Start Column field allows you to ignore leading characters before a comparison is made. This can be very useful for ignoring line numbers is source code etc. It can also be used (in conjunction with an external sort program) to remove duplicates from an unsorted file. First add line numbers to every line. Then use an external sort program on the file, ignoring the line numbers (the first 6 columns). Then use this filter (with Ignore Leading Characters set to 6) to remove duplicate lines. Finally, remove the line numbers using the Remove Columns (remove first 6 columns) filter.
- Length
The comparison can also ignore trailing characters if desired, by setting the length to less than the length of the line.
Remove ANSI codes
ANSI (American National Standards Institute) codes are included in various streams of information (most commonly BBS connections), to provide a remote computer with control over cursor positioning, text attributes, etc. They are also used in connections between minicomputers and mainframe computers and the terminals connected to them.
The need to use an ANSI filter can be recognized when something like the following example shows up in a file viewed in a text editor:
<[0;1;4mas<[m - MC88000 assembler
In this example the "as" near the beginning is displayed in a different color than the rest of the line when the ANSI codes are properly processed. The Escape (ASCII 27) codes above have been replaced by the < symbol to make this line printable.
The Remove ANSI Escape Sequences filter can be used to filter out these codes and "clean up" the text so that it can be used in standard fashions such as copying and pasting into a word processor. On Unix machines the man (manual) help utility will only allow page-by-page browsing through a file in a forward direction. By piping the man output to a text file, transferring it to a DOS machine, and running it through the Remove ANSI Escape Sequences filter (and the Convert EOL filter - Unix to DOS if desired), a standard DOS editor can be used for browsing through the file, quoting from it, etc.
Remove email headers
This filter removes the email headers that accompany emails exported to a text format. The email headers are the lines such as To:, From:, Subject: and various other message headers added by all the servers through which your email passes before it gets to its destination.
|