-
Notifications
You must be signed in to change notification settings - Fork 433
link patterns
The link-patterns
extra enables mapping of given regex patterns in text to
links. The link_patterns
list on the Markdown
class provides
the mapping.
For example, the following with map "recipe NNNN" and "komodo bug NNNN" to appropriate links:
>>> import markdown2, re
>>> link_patterns = [
... (re.compile("recipe\s+(\d+)", re.I), r"http://code.activestate.com/recipes/\1/"),
... (re.compile("(?:komodo\s+)?bug\s+(\d+)", re.I), r"http://bugs.activestate.com/show_bug.cgi?id=\1"),
... ]
>>> markdown2.markdown('Recipe 123 and Komodo bug 234 are related.'
... extras=["link-patterns"], link_patterns=link_patterns)
'<p><a href="http://code.activestate.com/recipes/123/">Recipe 123</a> and \
<a href="http://bugs.activestate.com/show_bug.cgi?id=234">Komodo bug 234</a> are related.</p>'
Here is a script that uses link-patterns to auto-link WikiWords (for Wiki syntaxes that do that):
https://github.com/trentm/python-markdown2/blob/master/sandbox/wiki.py
For the command-line interface, the --link-patterns-file
option has been
added. A "link patterns file" has one link pattern per line of the form:
<regex-pattern> <href-pattern>
For example:
/recipe\s+(\d+)/i http://code.activestate.com/recipes/\1/
/(?:komodo\s+)?bug\s+(\d+)/i http://bugs.activestate.com/show_bug.cgi?id=\1
Spaces are not allowed in the <href-pattern>
(to simplify parsing). Lines
beginning with a hash (#) are comments.
$ python markdown2.py -x link-patterns --link-patterns-file patterns foo.text
from markdown2 import Markdown
import re
pattern = re.compile(
r"""
\b
(
(?:https?://|(?<!//)www\.) # prefix - https:// or www.
\w[\w_\-]*(?:\.\w[\w_\-]*)* # host
[^<>\s"']* # rest of url
(?<![?!.,:*_~);]) # exclude trailing punctuation
(?=[?!.,:*_~);]?(?:[<\s]|$)) # make sure that we're not followed by " or ', i.e. we're outside of href="...".
)
""",
re.X
)
markdown = Markdown(
extras=["link-patterns"],
link_patterns=[(pattern, r'\1')]
)
markdown.convert('http://www.google.com')
markdown.convert('<a href="http://www.google.com">link</a>')
markdown.convert('[link](http://www.google.com)')
Output
u'<p><a href="http://www.google.com">http://www.google.com</a></p>\n'
u'<p><a href="http://www.google.com">link</a></p>\n'
u'<p><a href="http://www.google.com">link</a></p>\n'
Without specifying the link-patterns
extra, the module will use standard markdown link format [text](link)
. Please note that it only allows the URL schemas http(s) and ftp; otherwise it outputs a bookmark link #
.
(Return to Extras page.)