Gareth Jones
7edbc5a22b
ci: update actions/checkout to v4 ( #233 )
...
* ci: update `actions/checkout` to v4
2025-07-14 21:52:04 +02:00
alheiveea
76e5edb357
limit colspan values to range [1, 1000] ( #232 )
2025-07-09 22:08:47 +02:00
Chris Papademetrious
48724e7002
support backticks in <code> spans ( #226 ) ( #230 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-06-29 14:56:21 -04:00
Chris Papademetrious
9b1412aa5b
implement a strip_pre configuration option ( #218 ) ( #222 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-06-14 16:37:47 -04:00
Chris Papademetrious
75ab3064dd
allow BeautifulSoup configuration kwargs to be specified ( #224 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-06-14 09:06:22 -04:00
Chris Papademetrious
016251e915
ensure that explicitly provided heading conversion functions are used ( #212 ) ( #214 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-05-03 10:57:09 -04:00
Colin
0e1a849346
Add conversion support for <q> tags ( #217 )
2025-04-28 06:37:33 -04:00
Chris Papademetrious
e29de4e753
make convert_hn() public instead of internal ( #213 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-04-20 06:20:01 -04:00
Vincent Kelleher
2d654a6b7e
Add beautiful_soup_parser option ( #206 )
...
* add beautiful_soup_parser option
* add Beautiful Soup parser argument to command line
---------
Co-authored-by: Vincent Kelleher <vincent.kelleher-ext@francetravail.fr >
Co-authored-by: AlexVonB <AlexVonB@users.noreply.github.com >
2025-03-29 11:29:29 +01:00
chrispy
13183f9925
bump to version v1.1.0
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-03-05 06:47:28 -05:00
Stephen V. Brown
7908f1492a
Generalize handling of colspan in case where colspan is in first row but header row is missing ( #203 )
2025-03-04 20:01:16 -05:00
Chris Papademetrious
618747c18c
in inline contexts, resolve <br/> to a space instead of an empty string ( #202 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-03-04 07:37:22 -05:00
Chris Papademetrious
5122c973c1
add missing newlines for definition lists ( #200 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-03-02 06:42:56 -05:00
itmammoth
ac5736f0a3
Support video tag with poster attribute ( #189 )
2025-02-28 10:51:42 +01:00
chrispy
daa9e28287
bump to version v1.0.0
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-24 16:18:23 -05:00
Chris Papademetrious
ba5e222b45
use compiled regex for escaping patterns ( #194 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-24 12:29:09 -05:00
Chris Papademetrious
6984dca7ab
use a conversion function cache to improve runtime ( #196 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-24 11:48:40 -05:00
Chris Papademetrious
24977fd192
rename regex pattern variables ( #195 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-19 20:01:12 -05:00
Joseph Myers
c7329ac1ef
Escape right square brackets ( #187 )
2025-02-19 10:04:29 -05:00
Joseph Myers
3311f4d896
Avoid stripping nonbreaking spaces ( #188 )
2025-02-19 07:40:53 -05:00
Chris Papademetrious
5655f27208
propagate parent tag context downward to improve runtime ( #191 )
2025-02-18 16:35:36 -05:00
Chris Papademetrious
c52ba47166
use list-based processing (inspired by AlextheYounga) ( #186 )
2025-02-17 05:47:19 -08:00
Chris Papademetrious
3026602686
make conversion non-destructive to soup; improve div/article/section handling ( #184 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-04 18:09:24 -05:00
Chris Papademetrious
c52a50e66a
when computing <ol><li> numbering, ignore non-<li> previous siblings ( #183 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-04 15:39:32 -05:00
Chris Papademetrious
d0c4b85fd5
simplify computation of convert_children_as_inline variable ( #182 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-02-04 15:20:42 -05:00
Chris Papademetrious
ae0597d80c
remove superfluous leading/trailing whitespace ( #181 )
2025-01-27 11:55:32 -05:00
Chris Papademetrious
dbb5988802
add blank line before/after preformatted block ( #179 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-01-21 11:01:11 -05:00
Chris Papademetrious
f24ec9e83c
add blank line before ATX-style headings to avoid ambiguity ( #178 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-01-21 11:00:51 -05:00
Chris Papademetrious
7fec8a2080
code simplification to remove need for children_only parameter ( #174 )
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-01-19 10:23:58 -05:00
Fess-AKA-DeadMonk
1b3333073a
for convert_* functions, allow for tags with special characters in their name (like "subtag-name") ( #136 )
...
support custom conversion functions for tags with `:` and `-` characters in their names by mapping them to underscores in the function name
2025-01-19 09:48:08 -05:00
SomeBottle
3bf0b527a4
Add a new configuration option to control tabler header row inference ( #161 )
...
Add option to infer first table row as table header (defaults to false)
2025-01-19 08:13:24 -05:00
Chris Papademetrious
1783995cb2
Merge pull request #173 from chrispy-snps/chrispy/support-definition-lists
...
support HTML definition lists (`<dl>`, `<dt>`, and `<dd>`)
2025-01-18 19:45:03 -05:00
chrispy
0fb855676d
support HTML definition lists (<dl>, <dt>, and <dd>)
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-01-18 19:43:28 -05:00
Chris Papademetrious
f73a435315
Merge pull request #171 from chrispy-snps/chrispy/optimize-li-blockquote-empty-lines
...
optimize empty-line handling for li and blockquote content
2025-01-18 19:30:06 -05:00
chrispy
17c3678d0e
optimize empty-line handling for li and blockquote content
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2025-01-18 19:25:03 -05:00
Chris Papademetrious
600f77d244
allow a wrap_width value of None for unlimited line lengths ( #169 )
...
allow a wrap_width value of None to reflow text to unlimited line length
2025-01-18 19:20:22 -05:00
Chris Papademetrious
9339571ae9
Merge pull request #167 from chrispy-snps/chrispy/table-caption-blank-line
...
insert a blank line between table caption, table content
2025-01-18 19:09:24 -05:00
Chris Papademetrious
5bc3059abf
Merge pull request #165 from chrispy-snps/chrispy/fix-a-in-code
...
do not construct Markdown links in code spans and code blocks
2025-01-18 19:06:51 -05:00
chrispy
1009087d41
insert a blank line between table caption, table content
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2024-12-29 13:52:32 -05:00
chrispy
71e1471e18
do not construct Markdown links in code spans and code blocks
...
Signed-off-by: chrispy <chrispy@synopsys.com >
2024-12-29 12:33:46 -05:00
AlexVonB
6258f5c38b
bump to version v0.14.1
2024-11-24 23:05:02 +01:00
AlexVonB
3466061ca9
prevent <hn> to call convert_hn and crash
...
fixes #142
2024-11-24 21:20:57 +01:00
AlexVonB
9595618796
prevent very large headline prefixes
...
for example: `<h9999999>` could crash the conversion.
fixes #143
2024-11-24 21:11:42 +01:00
AlexVonB
fe8a821a20
bump to version v0.14.0
2024-11-24 12:26:29 +01:00
AlexVonB
54c7ca9937
renamed functions that return boolean
2024-11-24 12:10:57 +01:00
AlexVonB
19780834af
Merge branch 'alfonsrv-fix-pr-118' into jsm28-list-indentation
2024-11-24 12:07:59 +01:00
AlexVonB
9202027e26
ignore bs4 warnings in tests
2024-11-24 12:00:27 +01:00
AlexVonB
9bf4ff14b9
Merge branch 'jsm28-selective-escaping' into jsm28-list-indentation
2024-11-20 14:16:06 +01:00
alfonsrv
7ff4d835ae
Set escape_misc to False by default to improve backwards compatibility
2024-10-09 18:55:50 +02:00
Joseph Myers
c13bdd5c14
Fix logic for indentation inside list items
...
This fixes problems with the markdownify logic for indentation inside
list items.
This PR uses a branch building on that for #120 , #150 and #151 , so
those three PRs should be merged first before merging this one.
There is limited logic in markdownify for handling indentation in the
case of nested lists. There are two major problems with this logic:
* As it's in `convert_list`, causing a list to be indented when inside
another list, it does not add indentation for any other elements
such as paragraphs that may be found inside list items (or `<pre>`,
`<blockquote>`, etc.), so such elements are wrongly not indented and
terminate the list in the output.
* It uses fixed indentation of one tab. Following CommonMark, a tab
in Markdown is considered equivalent to four spaces, which is not
sufficient indentation in ordered list items with a number of three
or more digits.
Fix both of these issues by making `convert_li` handle indentation for
the contents of `<li>`, based on the length of the list item marker,
rather than doing it in `convert_list` at all.
2024-10-03 21:04:40 +00:00