summaryrefslogtreecommitdiff
path: root/doc/character_selector.rdoc
blob: 9bc477ea71f5ee2cd2a0b922be441a9cb00f6a34 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
== Character Selectors

A _character_ _selector_ is a string argument accepted by certain Ruby methods.
Each of these instance methods accepts one or more character selectors:

- String#tr(selector, replacements): returns a new string.
- String#tr!(selector, replacements): returns +self+.
- String#tr_s(selector, replacements): returns a new string.
- String#tr_s!(selector, replacements): returns +self+.
- String#delete(*selectors): returns a new string.
- String#delete!(*selectors): returns +self+.
- String#count(*selectors): counts specified characters.

A character selector identifies zero or more characters in +self+
that are to be operands for the method.

In this section, we illustrate using method String#delete(selector),
which deletes the selected characters.

In the simplest case, the characters selected are exactly those
contained in the selector itself:

  'abracadabra'.delete('a')   # => "brcdbr"
  'abracadabra'.delete('ab')  # => "rcdr"
  'abracadabra'.delete('abc') # => "rdr"
  '0123456789'.delete('258')  # => "0134679"
  '!@#$%&*()_+'.delete('+&#') # => "!@$%*()_"
  'тест'.delete('т')          # => "ес"
  'こんにちは'.delete('に')     # => "こんちは"

Note that order and repetitions do not matter:

  'abracadabra'.delete('dcab') # => "rr"
  'abracadabra'.delete('aaaa') # => "brcdbr"

In a character selector, these three characters get special treatment:

- A leading caret (<tt>'^'</tt>') functions as a "not" operator
  for the characters to its right:

    'abracadabra'.delete('^bc') # => "bcb"
    '0123456789'.delete('^852') # => "258"

- A hyphen (<tt>'-'</tt>) between two other characters
  defines a range of characters instead of a plain string of characters:

    'abracadabra'.delete('a-d') # => "rr"
    '0123456789'.delete('4-7')  # => "012389"
    '!@#$%&*()_+'.delete(' -/') # => "@^_"

    # May contain more than one range.
    'abracadabra'.delete('a-cq-t') # => "d"

    # Ranges may be mixed with plain characters.
    '0123456789'.delete('67-950-23') # => "4"

    # Ranges may be mixed with negations.
    'abracadabra'.delete('^a-c') # => "abacaaba"

- A backslash (<tt>'\'</tt>) acts as an escape for a caret, a hyphen,
  or another backslash:

    'abracadabra^'.delete('\^bc')   # => "araadara"
    'abracadabra-'.delete('a\-d')   # => "brcbr"
    "hello\r\nworld".delete("\r")   # => "hello\nworld"
    "hello\r\nworld".delete("\\r")  # => "hello\r\nwold"
    "hello\r\nworld".delete("\\\r") # => "hello\nworld"