ruby.git - The Ruby Programming Language

Age	Commit message (Collapse)	Author
22 hours	[ruby/strscan] [DOC] Doc for StringScanner#scan_integer	Burdette Lamar
	(https://github.com/ruby/strscan/pull/206) Came here to fix a broken link (second pattern generated a bogus link); stayed to add examples and generally brighten. https://github.com/ruby/strscan/commit/32cc23934c
2026-05-03	[ruby/strscan] Fix `call-seq` formatting	Earlopain
	(https://github.com/ruby/strscan/pull/203) They don't work in included markdown and also need a leading `:` <img width="791" height="442" alt="grafik" src="https://github.com/user-attachments/assets/eca5d4c2-9597-4c58-a0cc-a4e2c9109992" /> https://github.com/ruby/strscan/commit/b504b3b0ad
2026-04-17	Fixed the wrong dev version of strscan	Hiroshi SHIBATA

2026-03-31	[ruby/strscan] Implement compaction and embedded structs	Jean Boussier
	(https://github.com/ruby/strscan/pull/201) GC compaction was introduced in Ruby 2.7. Embedded Structs was introduced in Ruby 3.3. When enabled, the `struct strscanner` is stored directly inside the object slot, meaning reading the struct member doesn't require loading another memory region. https://github.com/ruby/strscan/commit/a018cbe40e
2026-03-29	Exclude StringScanner implementation for TruffleRuby	Nobuyoshi Nakada

2026-03-29	[ruby/strscan] [DOC] Undoc unused constants	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/1d85632f2a
2026-03-28	[ruby/strscan] Implement StringScanner for TruffleRuby in pure Ruby	Benoit Daloze
	(https://github.com/ruby/strscan/pull/195) * Fixes https://github.com/ruby/strscan/issues/194. * This is a fresh new implementation from scratch, contributed directly to ruby/strscan, under BSD-2-Clause. This was implemented using the strscan tests, specs from ruby/spec and the documentation (https://docs.ruby-lang.org/en/master/StringScanner.html). * lib/strscan.rb now handles the loading for all implementations. * Test on TruffleRuby 33 to ensure it keeps working on older TruffleRuby releases, which do not have the necessary Primitives used by this new implementation. --------- https://github.com/ruby/strscan/commit/3908e0ddb0 Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2026-03-15	[ruby/strscan] Do not use C99 style comment [ci skip]	Nobuyoshi Nakada
	The minimum required ruby version of this library is still 2.4. C99 has been adopted since ruby 2.7. https://github.com/ruby/strscan/commit/c50abc95ad
2025-12-26	Mark development version for unreleased gems	Hiroshi SHIBATA

2025-12-26	[ruby/strscan] Bump version	Sutou Kouhei
	https://github.com/ruby/strscan/commit/747a3b5def
2025-12-17	Bundle strscan-3.1.6	Hiroshi SHIBATA

2025-11-05	[ruby/strscan] [DOC] no doc for internal methods	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/5614095d9c
2025-11-05	[ruby/strscan] Deprecate constant `Id`	Nobuyoshi Nakada
	`$Id$` is for RCS, CVS, and SVN; no information with GIT. https://github.com/ruby/strscan/commit/9e3db14fa2
2025-11-05	[ruby/strscan] [DOC] Add document of StringScanner::Error	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/16ec901356
2025-11-05	[ruby/strscan] Deprecate undocumented toplevel constant `ScanError`	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/b4ddc3a2a6
2025-11-05	[ruby/strscan] ISO C90 forbids mixed declarations and code	Nobuyoshi Nakada
	Cannot use C99 syntax, as far as supporting Ruby 2.6 and earlier. https://github.com/ruby/strscan/commit/f6d178fda5
2025-11-05	[ruby/strscan] [DOC] Remove the statement `rest?` is obsolete	Nobuyoshi Nakada
	`eos?` is opposite, cannot be used instead of `rest?`. https://github.com/ruby/strscan/commit/bee8cc547b
2025-11-04	[ruby/strscan] Resurrect a method that has not been obsolete	Takashi Kokubun
	(https://github.com/ruby/strscan/pull/169) Partially revert https://github.com/ruby/strscan/pull/168 because strscan_rest_p did not have `rb_warning("StringScanner#rest? is obsolete")`. It is actively used by the latest tzinfo.gem, and we shouldn't remove it without deprecating it. https://github.com/ruby/strscan/commit/f3fdf21189
2025-11-04	[ruby/strscan] Remove methods have been obsolete over two decades	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/1387def685
2025-11-04	[ruby/strscan] Remove no longer used variable	Nobuyoshi Nakada
	Since https://github.com/ruby/strscan/commit/92961cde2b42. https://github.com/ruby/strscan/commit/911f9c682a
2025-07-11	Update dependencies for addition of set.h to public headers	Jeremy Evans

2025-07-01	[ruby/strscan] Run `have_func` with the header providing the declarations	Nobuyoshi Nakada
	https://github.com/ruby/strscan/commit/18c0a59b65
2025-06-12	[ruby/strscan] Update extconf.rb	Nobuyoshi Nakada
	(https://github.com/ruby/strscan/pull/158) - `have_func` includes "ruby.h" by default. - include "ruby/re.h" where `rb_reg_onig_match` is declared. https://github.com/ruby/strscan/commit/1ac96f47e9
2025-06-06	Bump up strscan version to 3.1.6.dev	Hiroshi SHIBATA

2025-06-06	[ruby/strscan] Implement Write Barrier	Daniel Colson
	(https://github.com/ruby/strscan/pull/156) StringScanner holds the string being scanned, and a regex for methods like `match?`. Triggering the write barrier for those allows us to mark this as WB protected. https://github.com/ruby/strscan/commit/32fec70407
2025-05-02	Bump up strscan version to 3.1.5.dev	Hiroshi SHIBATA

2025-05-02	[ruby/strscan] named_captures: fix incompatibility with	Sutou Kouhei
	MatchData#named_captures (https://github.com/ruby/strscan/pull/146) Fix https://github.com/ruby/strscan/pull/145 `MatchData#named_captures` use the last matched value for each name. Reported by Linus Sellberg. Thanks!!! https://github.com/ruby/strscan/commit/a6086ea322
2025-04-22	Mark development version for unreleased gems	Hiroshi SHIBATA

2025-04-22	[ruby/strscan] Bump version	Sutou Kouhei
	https://github.com/ruby/strscan/commit/8ff80150c4
2025-04-14	[ruby/strscan] Bump version	Sutou Kouhei
	https://github.com/ruby/strscan/commit/7b1eb1e4ed
2025-04-14	[ruby/strscan] Allow parsing strings larger than 2GiB	Jean byroot Boussier
	(https://github.com/ruby/strscan/pull/147) For a reason unknown, even though `pos` is stored as a `long`, the `#pos` and `#pos=` treat it as an `int`, which prevent seeking into strings larger than 2GiB. https://github.com/ruby/strscan/commit/b76368416e Co-authored-by: Jean Boussier <jean.boussier@gmail.com>
2025-02-25	[ruby/strscan] Fix a bug that inconsistency of IndexError vs nil for	NAITOH Jun
	unknown capture group (https://github.com/ruby/strscan/pull/143) Fix https://github.com/ruby/strscan/pull/139 Reported by Benoit Daloze. Thanks!!! https://github.com/ruby/strscan/commit/bc8a0d2623 Notes: Merged: https://github.com/ruby/ruby/pull/12804
2025-02-25	[ruby/strscan] Fix a bug that scanning methods that don't use Regexp	NAITOH Jun
	don't clear named capture groups (https://github.com/ruby/strscan/pull/142) Fix https://github.com/ruby/strscan/pull/135 https://github.com/ruby/strscan/commit/b957443e20 Notes: Merged: https://github.com/ruby/ruby/pull/12804
2025-02-21	[ruby/strscan] `scan_integer(base: 16)` ignore x suffix if not	Jean Boussier
	followed by hexadecimal (https://github.com/ruby/strscan/pull/141) Fix: https://github.com/ruby/strscan/issues/140 `0x<EOF>`, `0xZZZ` should be parsed as `0` instead of not matching at all. https://github.com/ruby/strscan/commit/c4e4795ed2
2025-02-17	[ruby/strscan] Fix a bug that scan_until behaves differently with	NAITOH Jun
	Regexp and String patterns (https://github.com/ruby/strscan/pull/138) Fix https://github.com/ruby/strscan/pull/131 https://github.com/ruby/strscan/commit/e1cec2e726
2025-02-14	Removed trailing spaces	Hiroshi SHIBATA

2025-02-14	[ruby/strscan] Fix a bug that scan_integer doesn't update matched	Jean Boussier
	data (https://github.com/ruby/strscan/pull/133) Fix https://github.com/ruby/strscan/pull/130 Reported by Andrii Konchyn. Thanks!!! https://github.com/ruby/strscan/commit/4e5f17f87a
2024-12-16	[ruby/strscan] [DOC] Add syntax highlighting to MarkDown code blocks	Alexander Momchilov
	(https://github.com/ruby/strscan/pull/126) Split off from https://github.com/ruby/ruby/pull/12322 https://github.com/ruby/strscan/commit/9bee37e0f5
2024-12-16	[ruby/strscan] Bump version	Sutou Kouhei
	https://github.com/ruby/strscan/commit/fd140b8582
2024-12-12	Lock released version of strscan-3.1.1	Hiroshi SHIBATA

2024-12-02	Removed trailing spaces	Hiroshi SHIBATA

2024-12-02	[ruby/strscan] Micro optimize encoding checks	Jean Boussier
	(https://github.com/ruby/strscan/pull/117) Profiling shows a lot of time spent in various encoding check functions. I'm working on optimizing them on the Ruby side, but if we assume most strings are one of the simple 3 encodings, we can skip a lot of overhead. ```ruby require 'strscan' require 'benchmark/ips' source = 10_000.times.map { rand(9999999).to_s }.join(",").force_encoding(Encoding::UTF_8).freeze def scan_to_i(source) scanner = StringScanner.new(source) while number = scanner.scan(/\d+/) number.to_i scanner.skip(",") end end def scan_integer(source) scanner = StringScanner.new(source) while scanner.scan_integer scanner.skip(",") end end Benchmark.ips do \|x\| x.report("scan.to_i") { scan_to_i(source) } x.report("scan_integer") { scan_integer(source) } x.compare! end ``` Before: ``` ruby 3.3.4 (2024-07-09 revision https://github.com/ruby/strscan/commit/be1089c8ec) +YJIT [arm64-darwin23] Warming up -------------------------------------- scan.to_i 93.000 i/100ms scan_integer 232.000 i/100ms Calculating ------------------------------------- scan.to_i 933.191 (± 0.2%) i/s (1.07 ms/i) - 4.743k in 5.082597s scan_integer 2.326k (± 0.8%) i/s (429.99 μs/i) - 11.832k in 5.087974s Comparison: scan_integer: 2325.6 i/s scan.to_i: 933.2 i/s - 2.49x slower ``` After: ``` ruby 3.3.4 (2024-07-09 revision https://github.com/ruby/strscan/commit/be1089c8ec) +YJIT [arm64-darwin23] Warming up -------------------------------------- scan.to_i 96.000 i/100ms scan_integer 274.000 i/100ms Calculating ------------------------------------- scan.to_i 969.489 (± 0.2%) i/s (1.03 ms/i) - 4.896k in 5.050114s scan_integer 2.756k (± 0.1%) i/s (362.88 μs/i) - 13.974k in 5.070837s Comparison: scan_integer: 2755.8 i/s scan.to_i: 969.5 i/s - 2.84x slower ``` https://github.com/ruby/strscan/commit/c02b1ce684
2024-12-02	StringScanner#scan_integer support base 16 integers (#116)	Jean Boussier
	Followup: https://github.com/ruby/strscan/pull/115 `scan_integer` is now implemented in Ruby as to efficiently handle keyword arguments without allocating a Hash. Given the goal of `scan_integer` is to more effciently parse integers without having to allocate an intermediary object, using `rb_scan_args` would defeat the purpose. Additionally, the C implementation now uses `rb_isdigit` and `rb_isxdigit`, because on Windows `isdigit` is locale dependent.
2024-11-27	[ruby/strscan] Implement #scan_integer to efficiently parse Integer	Jean Boussier
	(https://github.com/ruby/strscan/pull/115) Fix: https://github.com/ruby/strscan/issues/113 This allows to directly parse an Integer from a String without needing to first allocate a sub string. Notes: The implementation is limited by design, it's meant as a first step, only the most straightforward, based 10 integers are supported. https://github.com/ruby/strscan/commit/6a3c74b4c8
2024-10-26	[ruby/strscan] [CRuby] Optimize `strscan_do_scan()`: Remove	NAITOH Jun
	unnecessary use of `rb_enc_get()` (https://github.com/ruby/strscan/pull/108) - before: #106 ## Why? In `rb_strseq_index()`, the result of `rb_enc_check()` is used. - https://github.com/ruby/ruby/blob/6c7209cd3788ceec01e504d99057f9d3b396be84/string.c#L4335-L4368 > enc = rb_enc_check(str, sub); > return strseq_core(str_ptr, str_ptr_end, str_len, sub_ptr, sub_len, offset, enc); - https://github.com/ruby/ruby/blob/6c7209cd3788ceec01e504d99057f9d3b396be84/string.c#L4309-L4318 ```C strseq_core(const char str_ptr, const char str_ptr_end, long str_len, const char sub_ptr, long sub_len, long offset, rb_encoding enc) { const char search_start = str_ptr; long pos, search_len = str_len - offset; for (;;) { const char t; pos = rb_memsearch(sub_ptr, sub_len, search_start, search_len, enc); ``` ## Benchmark It shows String as a pattern is 1.24x faster than Regexp as a pattern. ``` $ benchmark-driver benchmark/check_until.yaml Warming up -------------------------------------- regexp 9.225M i/s - 9.328M times in 1.011068s (108.40ns/i) regexp_var 9.327M i/s - 9.413M times in 1.009214s (107.21ns/i) string 9.200M i/s - 9.355M times in 1.016840s (108.70ns/i) string_var 11.249M i/s - 11.255M times in 1.000578s (88.90ns/i) Calculating ------------------------------------- regexp 9.565M i/s - 27.676M times in 2.893476s (104.55ns/i) regexp_var 10.111M i/s - 27.982M times in 2.767496s (98.90ns/i) string 10.060M i/s - 27.600M times in 2.743465s (99.40ns/i) string_var 12.519M i/s - 33.746M times in 2.695615s (79.88ns/i) Comparison: string_var: 12518707.2 i/s regexp_var: 10111089.6 i/s - 1.24x slower string: 10060144.4 i/s - 1.24x slower regexp: 9565124.4 i/s - 1.31x slower ``` https://github.com/ruby/strscan/commit/ff2d7afa19
2024-10-26	[ruby/strscan] Use C90 as far as supporting 2.6 or earlier	Nobuyoshi Nakada
	(https://github.com/ruby/strscan/pull/101) https://github.com/ruby/strscan/commit/d31274f41b
2024-09-17	[ruby/strscan] Accept String as a pattern at non head	NAITOH Jun
	(https://github.com/ruby/strscan/pull/106) It supports non-head match cases such as StringScanner#scan_until. If we use a String as a pattern, we can improve match performance. Here is a result of the including benchmark. ## CRuby It shows String as a pattern is 1.18x faster than Regexp as a pattern. ``` $ benchmark-driver benchmark/check_until.yaml Warming up -------------------------------------- regexp 9.403M i/s - 9.548M times in 1.015459s (106.35ns/i) regexp_var 9.162M i/s - 9.248M times in 1.009479s (109.15ns/i) string 8.966M i/s - 9.274M times in 1.034343s (111.54ns/i) string_var 11.051M i/s - 11.190M times in 1.012538s (90.49ns/i) Calculating ------------------------------------- regexp 10.319M i/s - 28.209M times in 2.733707s (96.91ns/i) regexp_var 10.032M i/s - 27.485M times in 2.739807s (99.68ns/i) string 9.681M i/s - 26.897M times in 2.778397s (103.30ns/i) string_var 12.162M i/s - 33.154M times in 2.726046s (82.22ns/i) Comparison: string_var: 12161920.6 i/s regexp: 10318949.7 i/s - 1.18x slower regexp_var: 10031617.6 i/s - 1.21x slower string: 9680843.7 i/s - 1.26x slower ``` ## JRuby It shows String as a pattern is 2.11x faster than Regexp as a pattern. ``` $ benchmark-driver benchmark/check_until.yaml Warming up -------------------------------------- regexp 7.591M i/s - 7.544M times in 0.993780s (131.74ns/i) regexp_var 6.143M i/s - 6.125M times in 0.997038s (162.77ns/i) string 14.135M i/s - 14.079M times in 0.996067s (70.75ns/i) string_var 14.079M i/s - 14.057M times in 0.998420s (71.03ns/i) Calculating ------------------------------------- regexp 9.409M i/s - 22.773M times in 2.420268s (106.28ns/i) regexp_var 10.116M i/s - 18.430M times in 1.821820s (98.85ns/i) string 21.389M i/s - 42.404M times in 1.982519s (46.75ns/i) string_var 20.897M i/s - 42.237M times in 2.021187s (47.85ns/i) Comparison: string: 21389191.1 i/s string_var: 20897327.5 i/s - 1.02x slower regexp_var: 10116464.7 i/s - 2.11x slower regexp: 9409222.3 i/s - 2.27x slower ``` See: https://github.com/jruby/jruby/blob/be7815ec02356a58891c8727bb448f0c6a826d96/core/src/main/java/org/jruby/util/StringSupport.java#L1706-L1736 --------- https://github.com/ruby/strscan/commit/f9d96c446a Co-authored-by: Sutou Kouhei <kou@clear-code.com>
2024-08-31	Added pre-release suffix for development version of default gems	Hiroshi SHIBATA
	https://github.com/ruby/stringio/issues/81
2024-06-04	Sync strscan HEAD again.	Hiroshi SHIBATA
	https://github.com/ruby/strscan/pull/99 split document with multi-byte chars.
2024-05-30	Revert "[ruby/strscan] Doc for StringScanner"	Hiroshi SHIBATA
	This reverts commit 974ed1408c516d1e8f992f0b304e2de6f8bd5c1f.