Skip to content

Commit 259863d

Browse files
authored
doc: use HTTPS in links
PR #726
1 parent 2bab987 commit 259863d

File tree

6 files changed

+26
-26
lines changed

6 files changed

+26
-26
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -245,12 +245,12 @@ supported version of Rust.
245245
This project is licensed under either of
246246

247247
* Apache License, Version 2.0, ([LICENSE-APACHE](LICENSE-APACHE) or
248-
http://www.apache.org/licenses/LICENSE-2.0)
248+
https://www.apache.org/licenses/LICENSE-2.0)
249249
* MIT license ([LICENSE-MIT](LICENSE-MIT) or
250-
http://opensource.org/licenses/MIT)
250+
https://opensource.org/licenses/MIT)
251251

252252
at your option.
253253

254254
The data in `regex-syntax/src/unicode_tables/` is licensed under the Unicode
255255
License Agreement
256-
([LICENSE-UNICODE](http://www.unicode.org/copyright.html#License)).
256+
([LICENSE-UNICODE](https://www.unicode.org/copyright.html#License)).

UNICODE.md

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Unicode conformance
22

33
This document describes the regex crate's conformance to Unicode's
4-
[UTS#18](http://unicode.org/reports/tr18/)
4+
[UTS#18](https://unicode.org/reports/tr18/)
55
report, which lays out 3 levels of support: Basic, Extended and Tailored.
66

77
Full support for Level 1 ("Basic Unicode Support") is provided with two
@@ -10,7 +10,7 @@ exceptions:
1010
1. Line boundaries are not Unicode aware. Namely, only the `\n`
1111
(`END OF LINE`) character is recognized as a line boundary.
1212
2. The compatibility properties specified by
13-
[RL1.2a](http://unicode.org/reports/tr18/#RL1.2a)
13+
[RL1.2a](https://unicode.org/reports/tr18/#RL1.2a)
1414
are ASCII-only definitions.
1515

1616
Little to no support is provided for either Level 2 or Level 3. For the most
@@ -61,18 +61,18 @@ provide a convenient way to construct character classes of groups of code
6161
points specified by Unicode. The regex crate does not provide exhaustive
6262
support, but covers a useful subset. In particular:
6363

64-
* [General categories](http://unicode.org/reports/tr18/#General_Category_Property)
65-
* [Scripts and Script Extensions](http://unicode.org/reports/tr18/#Script_Property)
66-
* [Age](http://unicode.org/reports/tr18/#Age)
64+
* [General categories](https://unicode.org/reports/tr18/#General_Category_Property)
65+
* [Scripts and Script Extensions](https://unicode.org/reports/tr18/#Script_Property)
66+
* [Age](https://unicode.org/reports/tr18/#Age)
6767
* A smattering of boolean properties, including all of those specified by
68-
[RL1.2](http://unicode.org/reports/tr18/#RL1.2) explicitly.
68+
[RL1.2](https://unicode.org/reports/tr18/#RL1.2) explicitly.
6969

7070
In all cases, property name and value abbreviations are supported, and all
7171
names/values are matched loosely without regard for case, whitespace or
7272
underscores. Property name aliases can be found in Unicode's
73-
[`PropertyAliases.txt`](http://www.unicode.org/Public/UCD/latest/ucd/PropertyAliases.txt)
73+
[`PropertyAliases.txt`](https://www.unicode.org/Public/UCD/latest/ucd/PropertyAliases.txt)
7474
file, while property value aliases can be found in Unicode's
75-
[`PropertyValueAliases.txt`](http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt)
75+
[`PropertyValueAliases.txt`](https://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt)
7676
file.
7777

7878
The syntax supported is also consistent with the UTS#18 recommendation:
@@ -149,10 +149,10 @@ properties correspond to properties required by RL1.2):
149149

150150
## RL1.2a Compatibility Properties
151151

152-
[UTS#18 RL1.2a](http://unicode.org/reports/tr18/#RL1.2a)
152+
[UTS#18 RL1.2a](https://unicode.org/reports/tr18/#RL1.2a)
153153

154154
The regex crate only provides ASCII definitions of the
155-
[compatibility properties documented in UTS#18 Annex C](http://unicode.org/reports/tr18/#Compatibility_Properties)
155+
[compatibility properties documented in UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties)
156156
(sans the `\X` class, for matching grapheme clusters, which isn't provided
157157
at all). This is because it seems to be consistent with most other regular
158158
expression engines, and in particular, because these are often referred to as
@@ -165,7 +165,7 @@ Their traditional ASCII definition can be used by disabling Unicode. That is,
165165

166166
## RL1.3 Subtraction and Intersection
167167

168-
[UTS#18 RL1.3](http://unicode.org/reports/tr18/#Subtraction_and_Intersection)
168+
[UTS#18 RL1.3](https://unicode.org/reports/tr18/#Subtraction_and_Intersection)
169169

170170
The regex crate provides full support for nested character classes, along with
171171
union, intersection (`&&`), difference (`--`) and symmetric difference (`~~`)
@@ -178,7 +178,7 @@ For example, to match all non-ASCII letters, you could use either
178178

179179
## RL1.4 Simple Word Boundaries
180180

181-
[UTS#18 RL1.4](http://unicode.org/reports/tr18/#Simple_Word_Boundaries)
181+
[UTS#18 RL1.4](https://unicode.org/reports/tr18/#Simple_Word_Boundaries)
182182

183183
The regex crate provides basic Unicode aware word boundary assertions. A word
184184
boundary assertion can be written as `\b`, or `\B` as its negation. A word
@@ -196,9 +196,9 @@ the following classes:
196196
* `\p{gc:Connector_Punctuation}`
197197

198198
In particular, this differs slightly from the
199-
[prescription given in RL1.4](http://unicode.org/reports/tr18/#Simple_Word_Boundaries)
199+
[prescription given in RL1.4](https://unicode.org/reports/tr18/#Simple_Word_Boundaries)
200200
but is permissible according to
201-
[UTS#18 Annex C](http://unicode.org/reports/tr18/#Compatibility_Properties).
201+
[UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties).
202202
Namely, it is convenient and simpler to have `\w` and `\b` be in sync with
203203
one another.
204204

@@ -211,7 +211,7 @@ boundaries is currently sub-optimal on non-ASCII text.
211211

212212
## RL1.5 Simple Loose Matches
213213

214-
[UTS#18 RL1.5](http://unicode.org/reports/tr18/#Simple_Loose_Matches)
214+
[UTS#18 RL1.5](https://unicode.org/reports/tr18/#Simple_Loose_Matches)
215215

216216
The regex crate provides full support for case insensitive matching in
217217
accordance with RL1.5. That is, it uses the "simple" case folding mapping. The
@@ -226,7 +226,7 @@ then all characters classes are case folded as well.
226226

227227
## RL1.6 Line Boundaries
228228

229-
[UTS#18 RL1.6](http://unicode.org/reports/tr18/#Line_Boundaries)
229+
[UTS#18 RL1.6](https://unicode.org/reports/tr18/#Line_Boundaries)
230230

231231
The regex crate only provides support for recognizing the `\n` (`END OF LINE`)
232232
character as a line boundary. This choice was made mostly for implementation
@@ -239,7 +239,7 @@ well, and in theory, this could be done efficiently.
239239

240240
## RL1.7 Code Points
241241

242-
[UTS#18 RL1.7](http://unicode.org/reports/tr18/#Supplementary_Characters)
242+
[UTS#18 RL1.7](https://unicode.org/reports/tr18/#Supplementary_Characters)
243243

244244
The regex crate provides full support for Unicode code point matching. Namely,
245245
the fundamental atom of any match is always a single code point.

regex-syntax/src/lib.rs

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -216,7 +216,7 @@ pub fn is_meta_character(c: char) -> bool {
216216
/// character.
217217
///
218218
/// A Unicode word character is defined by
219-
/// [UTS#18 Annex C](http://unicode.org/reports/tr18/#Compatibility_Properties).
219+
/// [UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties).
220220
/// In particular, a character
221221
/// is considered a word character if it is in either of the `Alphabetic` or
222222
/// `Join_Control` properties, or is in one of the `Decimal_Number`, `Mark`
@@ -236,7 +236,7 @@ pub fn is_word_character(c: char) -> bool {
236236
/// character.
237237
///
238238
/// A Unicode word character is defined by
239-
/// [UTS#18 Annex C](http://unicode.org/reports/tr18/#Compatibility_Properties).
239+
/// [UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties).
240240
/// In particular, a character
241241
/// is considered a word character if it is in either of the `Alphabetic` or
242242
/// `Join_Control` properties, or is in one of the `Decimal_Number`, `Mark`

regex-syntax/src/unicode.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -823,7 +823,7 @@ fn symbolic_name_normalize(x: &str) -> String {
823823
/// The slice returned is guaranteed to be valid UTF-8 for all possible values
824824
/// of `slice`.
825825
///
826-
/// See: http://unicode.org/reports/tr44/#UAX44-LM3
826+
/// See: https://unicode.org/reports/tr44/#UAX44-LM3
827827
fn symbolic_name_normalize_bytes(slice: &mut [u8]) -> &mut [u8] {
828828
// I couldn't find a place in the standard that specified that property
829829
// names/aliases had a particular structure (unlike character names), but

src/lib.rs

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -253,7 +253,7 @@ assert_eq!((mat.start(), mat.end()), (3, 23));
253253
```
254254
255255
For a more detailed breakdown of Unicode support with respect to
256-
[UTS#18](http://unicode.org/reports/tr18/),
256+
[UTS#18](https://unicode.org/reports/tr18/),
257257
please see the
258258
[UNICODE](https://github.com/rust-lang/regex/blob/master/UNICODE.md)
259259
document in the root of the regex repository.
@@ -455,7 +455,7 @@ assert_eq!(&cap[0], "abc");
455455
## Perl character classes (Unicode friendly)
456456
457457
These classes are based on the definitions provided in
458-
[UTS#18](http://www.unicode.org/reports/tr18/#Compatibility_Properties):
458+
[UTS#18](https://www.unicode.org/reports/tr18/#Compatibility_Properties):
459459
460460
<pre class="rust">
461461
\d digit (\p{Nd})

src/sparse.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ use std::slice;
88
/// entire set can also be done in constant time. Iteration yields elements
99
/// in the order in which they were inserted.
1010
///
11-
/// The data structure is based on: http://research.swtch.com/sparse
11+
/// The data structure is based on: https://research.swtch.com/sparse
1212
/// Note though that we don't actually use uninitialized memory. We generally
1313
/// reuse allocations, so the initial allocation cost is bareable. However,
1414
/// its other properties listed above are extremely useful.

0 commit comments

Comments
 (0)