buildframework/helium/external/python/lib/common/docutils-0.5-py2.5.egg/docutils/parsers/rst/states.py
author wbernard
Wed, 23 Dec 2009 19:29:07 +0200
changeset 179 d8ac696cc51f
permissions -rw-r--r--
helium_7.0-r14027
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
179
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     1
# $Id: states.py 4824 2006-12-09 00:59:23Z goodger $
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     2
# Author: David Goodger <goodger@python.org>
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     3
# Copyright: This module has been placed in the public domain.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     4
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     5
"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     6
This is the ``docutils.parsers.restructuredtext.states`` module, the core of
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     7
the reStructuredText parser.  It defines the following:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     8
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
     9
:Classes:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    10
    - `RSTStateMachine`: reStructuredText parser's entry point.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    11
    - `NestedStateMachine`: recursive StateMachine.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    12
    - `RSTState`: reStructuredText State superclass.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    13
    - `Inliner`: For parsing inline markup.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    14
    - `Body`: Generic classifier of the first line of a block.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    15
    - `SpecializedBody`: Superclass for compound element members.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    16
    - `BulletList`: Second and subsequent bullet_list list_items
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    17
    - `DefinitionList`: Second+ definition_list_items.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    18
    - `EnumeratedList`: Second+ enumerated_list list_items.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    19
    - `FieldList`: Second+ fields.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    20
    - `OptionList`: Second+ option_list_items.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    21
    - `RFC2822List`: Second+ RFC2822-style fields.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    22
    - `ExtensionOptions`: Parses directive option fields.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    23
    - `Explicit`: Second+ explicit markup constructs.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    24
    - `SubstitutionDef`: For embedded directives in substitution definitions.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    25
    - `Text`: Classifier of second line of a text block.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    26
    - `SpecializedText`: Superclass for continuation lines of Text-variants.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    27
    - `Definition`: Second line of potential definition_list_item.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    28
    - `Line`: Second line of overlined section title or transition marker.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    29
    - `Struct`: An auxiliary collection class.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    30
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    31
:Exception classes:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    32
    - `MarkupError`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    33
    - `ParserError`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    34
    - `MarkupMismatch`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    35
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    36
:Functions:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    37
    - `escape2null()`: Return a string, escape-backslashes converted to nulls.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    38
    - `unescape()`: Return a string, nulls removed or restored to backslashes.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    39
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    40
:Attributes:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    41
    - `state_classes`: set of State classes used with `RSTStateMachine`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    42
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    43
Parser Overview
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    44
===============
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    45
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    46
The reStructuredText parser is implemented as a recursive state machine,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    47
examining its input one line at a time.  To understand how the parser works,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    48
please first become familiar with the `docutils.statemachine` module.  In the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    49
description below, references are made to classes defined in this module;
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    50
please see the individual classes for details.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    51
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    52
Parsing proceeds as follows:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    53
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    54
1. The state machine examines each line of input, checking each of the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    55
   transition patterns of the state `Body`, in order, looking for a match.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    56
   The implicit transitions (blank lines and indentation) are checked before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    57
   any others.  The 'text' transition is a catch-all (matches anything).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    58
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    59
2. The method associated with the matched transition pattern is called.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    60
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    61
   A. Some transition methods are self-contained, appending elements to the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    62
      document tree (`Body.doctest` parses a doctest block).  The parser's
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    63
      current line index is advanced to the end of the element, and parsing
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    64
      continues with step 1.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    65
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    66
   B. Other transition methods trigger the creation of a nested state machine,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    67
      whose job is to parse a compound construct ('indent' does a block quote,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    68
      'bullet' does a bullet list, 'overline' does a section [first checking
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    69
      for a valid section header], etc.).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    70
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    71
      - In the case of lists and explicit markup, a one-off state machine is
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    72
        created and run to parse contents of the first item.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    73
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    74
      - A new state machine is created and its initial state is set to the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    75
        appropriate specialized state (`BulletList` in the case of the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    76
        'bullet' transition; see `SpecializedBody` for more detail).  This
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    77
        state machine is run to parse the compound element (or series of
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    78
        explicit markup elements), and returns as soon as a non-member element
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    79
        is encountered.  For example, the `BulletList` state machine ends as
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    80
        soon as it encounters an element which is not a list item of that
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    81
        bullet list.  The optional omission of inter-element blank lines is
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    82
        enabled by this nested state machine.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    83
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    84
      - The current line index is advanced to the end of the elements parsed,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    85
        and parsing continues with step 1.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    86
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    87
   C. The result of the 'text' transition depends on the next line of text.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    88
      The current state is changed to `Text`, under which the second line is
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    89
      examined.  If the second line is:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    90
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    91
      - Indented: The element is a definition list item, and parsing proceeds
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    92
        similarly to step 2.B, using the `DefinitionList` state.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    93
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    94
      - A line of uniform punctuation characters: The element is a section
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    95
        header; again, parsing proceeds as in step 2.B, and `Body` is still
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    96
        used.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    97
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    98
      - Anything else: The element is a paragraph, which is examined for
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
    99
        inline markup and appended to the parent element.  Processing
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   100
        continues with step 1.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   101
"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   102
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   103
__docformat__ = 'reStructuredText'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   104
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   105
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   106
import sys
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   107
import re
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   108
import roman
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   109
from types import TupleType, FunctionType, MethodType
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   110
from docutils import nodes, statemachine, utils, urischemes
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   111
from docutils import ApplicationError, DataError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   112
from docutils.statemachine import StateMachineWS, StateWS
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   113
from docutils.nodes import fully_normalize_name as normalize_name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   114
from docutils.nodes import whitespace_normalize_name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   115
from docutils.utils import escape2null, unescape, column_width
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   116
import docutils.parsers.rst
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   117
from docutils.parsers.rst import directives, languages, tableparser, roles
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   118
from docutils.parsers.rst.languages import en as _fallback_language_module
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   119
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   120
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   121
class MarkupError(DataError): pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   122
class UnknownInterpretedRoleError(DataError): pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   123
class InterpretedRoleNotImplementedError(DataError): pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   124
class ParserError(ApplicationError): pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   125
class MarkupMismatch(Exception): pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   126
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   127
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   128
class Struct:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   129
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   130
    """Stores data attributes for dotted-attribute access."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   131
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   132
    def __init__(self, **keywordargs):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   133
        self.__dict__.update(keywordargs)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   134
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   135
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   136
class RSTStateMachine(StateMachineWS):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   137
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   138
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   139
    reStructuredText's master StateMachine.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   140
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   141
    The entry point to reStructuredText parsing is the `run()` method.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   142
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   143
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   144
    def run(self, input_lines, document, input_offset=0, match_titles=1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   145
            inliner=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   146
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   147
        Parse `input_lines` and modify the `document` node in place.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   148
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   149
        Extend `StateMachineWS.run()`: set up parse-global data and
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   150
        run the StateMachine.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   151
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   152
        self.language = languages.get_language(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   153
            document.settings.language_code)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   154
        self.match_titles = match_titles
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   155
        if inliner is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   156
            inliner = Inliner()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   157
        inliner.init_customizations(document.settings)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   158
        self.memo = Struct(document=document,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   159
                           reporter=document.reporter,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   160
                           language=self.language,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   161
                           title_styles=[],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   162
                           section_level=0,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   163
                           section_bubble_up_kludge=0,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   164
                           inliner=inliner)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   165
        self.document = document
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   166
        self.attach_observer(document.note_source)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   167
        self.reporter = self.memo.reporter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   168
        self.node = document
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   169
        results = StateMachineWS.run(self, input_lines, input_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   170
                                     input_source=document['source'])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   171
        assert results == [], 'RSTStateMachine.run() results should be empty!'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   172
        self.node = self.memo = None    # remove unneeded references
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   173
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   174
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   175
class NestedStateMachine(StateMachineWS):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   176
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   177
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   178
    StateMachine run from within other StateMachine runs, to parse nested
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   179
    document structures.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   180
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   181
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   182
    def run(self, input_lines, input_offset, memo, node, match_titles=1):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   183
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   184
        Parse `input_lines` and populate a `docutils.nodes.document` instance.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   185
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   186
        Extend `StateMachineWS.run()`: set up document-wide data.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   187
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   188
        self.match_titles = match_titles
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   189
        self.memo = memo
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   190
        self.document = memo.document
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   191
        self.attach_observer(self.document.note_source)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   192
        self.reporter = memo.reporter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   193
        self.language = memo.language
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   194
        self.node = node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   195
        results = StateMachineWS.run(self, input_lines, input_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   196
        assert results == [], ('NestedStateMachine.run() results should be '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   197
                               'empty!')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   198
        return results
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   199
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   200
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   201
class RSTState(StateWS):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   202
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   203
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   204
    reStructuredText State superclass.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   205
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   206
    Contains methods used by all State subclasses.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   207
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   208
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   209
    nested_sm = NestedStateMachine
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   210
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   211
    def __init__(self, state_machine, debug=0):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   212
        self.nested_sm_kwargs = {'state_classes': state_classes,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   213
                                 'initial_state': 'Body'}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   214
        StateWS.__init__(self, state_machine, debug)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   215
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   216
    def runtime_init(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   217
        StateWS.runtime_init(self)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   218
        memo = self.state_machine.memo
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   219
        self.memo = memo
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   220
        self.reporter = memo.reporter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   221
        self.inliner = memo.inliner
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   222
        self.document = memo.document
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   223
        self.parent = self.state_machine.node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   224
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   225
    def goto_line(self, abs_line_offset):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   226
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   227
        Jump to input line `abs_line_offset`, ignoring jumps past the end.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   228
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   229
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   230
            self.state_machine.goto_line(abs_line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   231
        except EOFError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   232
            pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   233
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   234
    def no_match(self, context, transitions):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   235
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   236
        Override `StateWS.no_match` to generate a system message.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   237
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   238
        This code should never be run.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   239
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   240
        self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   241
            'Internal error: no transition pattern match.  State: "%s"; '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   242
            'transitions: %s; context: %s; current line: %r.'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   243
            % (self.__class__.__name__, transitions, context,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   244
               self.state_machine.line),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   245
            line=self.state_machine.abs_line_number())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   246
        return context, None, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   247
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   248
    def bof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   249
        """Called at beginning of file."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   250
        return [], []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   251
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   252
    def nested_parse(self, block, input_offset, node, match_titles=0,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   253
                     state_machine_class=None, state_machine_kwargs=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   254
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   255
        Create a new StateMachine rooted at `node` and run it over the input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   256
        `block`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   257
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   258
        if state_machine_class is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   259
            state_machine_class = self.nested_sm
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   260
        if state_machine_kwargs is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   261
            state_machine_kwargs = self.nested_sm_kwargs
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   262
        block_length = len(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   263
        state_machine = state_machine_class(debug=self.debug,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   264
                                            **state_machine_kwargs)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   265
        state_machine.run(block, input_offset, memo=self.memo,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   266
                          node=node, match_titles=match_titles)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   267
        state_machine.unlink()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   268
        new_offset = state_machine.abs_line_offset()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   269
        # No `block.parent` implies disconnected -- lines aren't in sync:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   270
        if block.parent and (len(block) - block_length) != 0:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   271
            # Adjustment for block if modified in nested parse:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   272
            self.state_machine.next_line(len(block) - block_length)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   273
        return new_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   274
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   275
    def nested_list_parse(self, block, input_offset, node, initial_state,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   276
                          blank_finish,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   277
                          blank_finish_state=None,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   278
                          extra_settings={},
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   279
                          match_titles=0,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   280
                          state_machine_class=None,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   281
                          state_machine_kwargs=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   282
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   283
        Create a new StateMachine rooted at `node` and run it over the input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   284
        `block`. Also keep track of optional intermediate blank lines and the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   285
        required final one.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   286
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   287
        if state_machine_class is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   288
            state_machine_class = self.nested_sm
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   289
        if state_machine_kwargs is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   290
            state_machine_kwargs = self.nested_sm_kwargs.copy()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   291
        state_machine_kwargs['initial_state'] = initial_state
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   292
        state_machine = state_machine_class(debug=self.debug,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   293
                                            **state_machine_kwargs)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   294
        if blank_finish_state is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   295
            blank_finish_state = initial_state
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   296
        state_machine.states[blank_finish_state].blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   297
        for key, value in extra_settings.items():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   298
            setattr(state_machine.states[initial_state], key, value)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   299
        state_machine.run(block, input_offset, memo=self.memo,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   300
                          node=node, match_titles=match_titles)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   301
        blank_finish = state_machine.states[blank_finish_state].blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   302
        state_machine.unlink()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   303
        return state_machine.abs_line_offset(), blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   304
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   305
    def section(self, title, source, style, lineno, messages):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   306
        """Check for a valid subsection and create one if it checks out."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   307
        if self.check_subsection(source, style, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   308
            self.new_subsection(title, lineno, messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   309
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   310
    def check_subsection(self, source, style, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   311
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   312
        Check for a valid subsection header.  Return 1 (true) or None (false).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   313
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   314
        When a new section is reached that isn't a subsection of the current
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   315
        section, back up the line count (use ``previous_line(-x)``), then
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   316
        ``raise EOFError``.  The current StateMachine will finish, then the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   317
        calling StateMachine can re-examine the title.  This will work its way
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   318
        back up the calling chain until the correct section level isreached.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   319
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   320
        @@@ Alternative: Evaluate the title, store the title info & level, and
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   321
        back up the chain until that level is reached.  Store in memo? Or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   322
        return in results?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   323
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   324
        :Exception: `EOFError` when a sibling or supersection encountered.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   325
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   326
        memo = self.memo
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   327
        title_styles = memo.title_styles
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   328
        mylevel = memo.section_level
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   329
        try:                            # check for existing title style
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   330
            level = title_styles.index(style) + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   331
        except ValueError:              # new title style
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   332
            if len(title_styles) == memo.section_level: # new subsection
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   333
                title_styles.append(style)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   334
                return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   335
            else:                       # not at lowest level
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   336
                self.parent += self.title_inconsistent(source, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   337
                return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   338
        if level <= mylevel:            # sibling or supersection
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   339
            memo.section_level = level   # bubble up to parent section
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   340
            if len(style) == 2:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   341
                memo.section_bubble_up_kludge = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   342
            # back up 2 lines for underline title, 3 for overline title
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   343
            self.state_machine.previous_line(len(style) + 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   344
            raise EOFError              # let parent section re-evaluate
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   345
        if level == mylevel + 1:        # immediate subsection
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   346
            return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   347
        else:                           # invalid subsection
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   348
            self.parent += self.title_inconsistent(source, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   349
            return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   350
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   351
    def title_inconsistent(self, sourcetext, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   352
        error = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   353
            'Title level inconsistent:', nodes.literal_block('', sourcetext),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   354
            line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   355
        return error
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   356
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   357
    def new_subsection(self, title, lineno, messages):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   358
        """Append new subsection to document tree. On return, check level."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   359
        memo = self.memo
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   360
        mylevel = memo.section_level
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   361
        memo.section_level += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   362
        section_node = nodes.section()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   363
        self.parent += section_node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   364
        textnodes, title_messages = self.inline_text(title, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   365
        titlenode = nodes.title(title, '', *textnodes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   366
        name = normalize_name(titlenode.astext())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   367
        section_node['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   368
        section_node += titlenode
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   369
        section_node += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   370
        section_node += title_messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   371
        self.document.note_implicit_target(section_node, section_node)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   372
        offset = self.state_machine.line_offset + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   373
        absoffset = self.state_machine.abs_line_offset() + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   374
        newabsoffset = self.nested_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   375
              self.state_machine.input_lines[offset:], input_offset=absoffset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   376
              node=section_node, match_titles=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   377
        self.goto_line(newabsoffset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   378
        if memo.section_level <= mylevel: # can't handle next section?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   379
            raise EOFError              # bubble up to supersection
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   380
        # reset section_level; next pass will detect it properly
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   381
        memo.section_level = mylevel
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   382
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   383
    def paragraph(self, lines, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   384
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   385
        Return a list (paragraph & messages) & a boolean: literal_block next?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   386
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   387
        data = '\n'.join(lines).rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   388
        if re.search(r'(?<!\\)(\\\\)*::$', data):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   389
            if len(data) == 2:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   390
                return [], 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   391
            elif data[-3] in ' \n':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   392
                text = data[:-3].rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   393
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   394
                text = data[:-1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   395
            literalnext = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   396
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   397
            text = data
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   398
            literalnext = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   399
        textnodes, messages = self.inline_text(text, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   400
        p = nodes.paragraph(data, '', *textnodes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   401
        p.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   402
        return [p] + messages, literalnext
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   403
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   404
    def inline_text(self, text, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   405
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   406
        Return 2 lists: nodes (text and inline elements), and system_messages.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   407
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   408
        return self.inliner.parse(text, lineno, self.memo, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   409
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   410
    def unindent_warning(self, node_name):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   411
        return self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   412
            '%s ends without a blank line; unexpected unindent.' % node_name,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   413
            line=(self.state_machine.abs_line_number() + 1))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   414
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   415
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   416
def build_regexp(definition, compile=1):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   417
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   418
    Build, compile and return a regular expression based on `definition`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   419
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   420
    :Parameter: `definition`: a 4-tuple (group name, prefix, suffix, parts),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   421
        where "parts" is a list of regular expressions and/or regular
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   422
        expression definitions to be joined into an or-group.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   423
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   424
    name, prefix, suffix, parts = definition
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   425
    part_strings = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   426
    for part in parts:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   427
        if type(part) is TupleType:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   428
            part_strings.append(build_regexp(part, None))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   429
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   430
            part_strings.append(part)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   431
    or_group = '|'.join(part_strings)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   432
    regexp = '%(prefix)s(?P<%(name)s>%(or_group)s)%(suffix)s' % locals()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   433
    if compile:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   434
        return re.compile(regexp, re.UNICODE)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   435
    else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   436
        return regexp
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   437
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   438
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   439
class Inliner:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   440
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   441
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   442
    Parse inline markup; call the `parse()` method.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   443
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   444
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   445
    def __init__(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   446
        self.implicit_dispatch = [(self.patterns.uri, self.standalone_uri),]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   447
        """List of (pattern, bound method) tuples, used by
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   448
        `self.implicit_inline`."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   449
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   450
    def init_customizations(self, settings):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   451
        """Setting-based customizations; run when parsing begins."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   452
        if settings.pep_references:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   453
            self.implicit_dispatch.append((self.patterns.pep,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   454
                                           self.pep_reference))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   455
        if settings.rfc_references:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   456
            self.implicit_dispatch.append((self.patterns.rfc,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   457
                                           self.rfc_reference))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   458
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   459
    def parse(self, text, lineno, memo, parent):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   460
        # Needs to be refactored for nested inline markup.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   461
        # Add nested_parse() method?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   462
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   463
        Return 2 lists: nodes (text and inline elements), and system_messages.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   464
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   465
        Using `self.patterns.initial`, a pattern which matches start-strings
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   466
        (emphasis, strong, interpreted, phrase reference, literal,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   467
        substitution reference, and inline target) and complete constructs
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   468
        (simple reference, footnote reference), search for a candidate.  When
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   469
        one is found, check for validity (e.g., not a quoted '*' character).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   470
        If valid, search for the corresponding end string if applicable, and
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   471
        check it for validity.  If not found or invalid, generate a warning
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   472
        and ignore the start-string.  Implicit inline markup (e.g. standalone
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   473
        URIs) is found last.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   474
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   475
        self.reporter = memo.reporter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   476
        self.document = memo.document
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   477
        self.language = memo.language
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   478
        self.parent = parent
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   479
        pattern_search = self.patterns.initial.search
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   480
        dispatch = self.dispatch
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   481
        remaining = escape2null(text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   482
        processed = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   483
        unprocessed = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   484
        messages = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   485
        while remaining:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   486
            match = pattern_search(remaining)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   487
            if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   488
                groups = match.groupdict()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   489
                method = dispatch[groups['start'] or groups['backquote']
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   490
                                  or groups['refend'] or groups['fnend']]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   491
                before, inlines, remaining, sysmessages = method(self, match,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   492
                                                                 lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   493
                unprocessed.append(before)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   494
                messages += sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   495
                if inlines:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   496
                    processed += self.implicit_inline(''.join(unprocessed),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   497
                                                      lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   498
                    processed += inlines
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   499
                    unprocessed = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   500
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   501
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   502
        remaining = ''.join(unprocessed) + remaining
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   503
        if remaining:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   504
            processed += self.implicit_inline(remaining, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   505
        return processed, messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   506
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   507
    openers = '\'"([{<'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   508
    closers = '\'")]}>'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   509
    start_string_prefix = (r'((?<=^)|(?<=[-/: \n%s]))' % re.escape(openers))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   510
    end_string_suffix = (r'((?=$)|(?=[-/:.,;!? \n\x00%s]))'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   511
                         % re.escape(closers))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   512
    non_whitespace_before = r'(?<![ \n])'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   513
    non_whitespace_escape_before = r'(?<![ \n\x00])'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   514
    non_whitespace_after = r'(?![ \n])'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   515
    # Alphanumerics with isolated internal [-._] chars (i.e. not 2 together):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   516
    simplename = r'(?:(?!_)\w)+(?:[-._](?:(?!_)\w)+)*'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   517
    # Valid URI characters (see RFC 2396 & RFC 2732);
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   518
    # final \x00 allows backslash escapes in URIs:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   519
    uric = r"""[-_.!~*'()[\];/:@&=+$,%a-zA-Z0-9\x00]"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   520
    # Delimiter indicating the end of a URI (not part of the URI):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   521
    uri_end_delim = r"""[>]"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   522
    # Last URI character; same as uric but no punctuation:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   523
    urilast = r"""[_~*/=+a-zA-Z0-9]"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   524
    # End of a URI (either 'urilast' or 'uric followed by a
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   525
    # uri_end_delim'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   526
    uri_end = r"""(?:%(urilast)s|%(uric)s(?=%(uri_end_delim)s))""" % locals()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   527
    emailc = r"""[-_!~*'{|}/#?^`&=+$%a-zA-Z0-9\x00]"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   528
    email_pattern = r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   529
          %(emailc)s+(?:\.%(emailc)s+)*   # name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   530
          (?<!\x00)@                      # at
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   531
          %(emailc)s+(?:\.%(emailc)s*)*   # host
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   532
          %(uri_end)s                     # final URI char
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   533
          """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   534
    parts = ('initial_inline', start_string_prefix, '',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   535
             [('start', '', non_whitespace_after,  # simple start-strings
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   536
               [r'\*\*',                # strong
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   537
                r'\*(?!\*)',            # emphasis but not strong
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   538
                r'``',                  # literal
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   539
                r'_`',                  # inline internal target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   540
                r'\|(?!\|)']            # substitution reference
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   541
               ),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   542
              ('whole', '', end_string_suffix, # whole constructs
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   543
               [# reference name & end-string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   544
                r'(?P<refname>%s)(?P<refend>__?)' % simplename,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   545
                ('footnotelabel', r'\[', r'(?P<fnend>\]_)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   546
                 [r'[0-9]+',               # manually numbered
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   547
                  r'\#(%s)?' % simplename, # auto-numbered (w/ label?)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   548
                  r'\*',                   # auto-symbol
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   549
                  r'(?P<citationlabel>%s)' % simplename] # citation reference
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   550
                 )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   551
                ]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   552
               ),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   553
              ('backquote',             # interpreted text or phrase reference
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   554
               '(?P<role>(:%s:)?)' % simplename, # optional role
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   555
               non_whitespace_after,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   556
               ['`(?!`)']               # but not literal
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   557
               )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   558
              ]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   559
             )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   560
    patterns = Struct(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   561
          initial=build_regexp(parts),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   562
          emphasis=re.compile(non_whitespace_escape_before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   563
                              + r'(\*)' + end_string_suffix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   564
          strong=re.compile(non_whitespace_escape_before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   565
                            + r'(\*\*)' + end_string_suffix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   566
          interpreted_or_phrase_ref=re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   567
              r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   568
              %(non_whitespace_escape_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   569
              (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   570
                `
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   571
                (?P<suffix>
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   572
                  (?P<role>:%(simplename)s:)?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   573
                  (?P<refend>__?)?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   574
                )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   575
              )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   576
              %(end_string_suffix)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   577
              """ % locals(), re.VERBOSE | re.UNICODE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   578
          embedded_uri=re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   579
              r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   580
              (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   581
                (?:[ \n]+|^)            # spaces or beginning of line/string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   582
                <                       # open bracket
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   583
                %(non_whitespace_after)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   584
                ([^<>\x00]+)            # anything but angle brackets & nulls
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   585
                %(non_whitespace_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   586
                >                       # close bracket w/o whitespace before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   587
              )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   588
              $                         # end of string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   589
              """ % locals(), re.VERBOSE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   590
          literal=re.compile(non_whitespace_before + '(``)'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   591
                             + end_string_suffix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   592
          target=re.compile(non_whitespace_escape_before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   593
                            + r'(`)' + end_string_suffix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   594
          substitution_ref=re.compile(non_whitespace_escape_before
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   595
                                      + r'(\|_{0,2})'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   596
                                      + end_string_suffix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   597
          email=re.compile(email_pattern % locals() + '$', re.VERBOSE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   598
          uri=re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   599
                (r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   600
                %(start_string_prefix)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   601
                (?P<whole>
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   602
                  (?P<absolute>           # absolute URI
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   603
                    (?P<scheme>             # scheme (http, ftp, mailto)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   604
                      [a-zA-Z][a-zA-Z0-9.+-]*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   605
                    )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   606
                    :
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   607
                    (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   608
                      (                       # either:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   609
                        (//?)?                  # hierarchical URI
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   610
                        %(uric)s*               # URI characters
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   611
                        %(uri_end)s             # final URI char
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   612
                      )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   613
                      (                       # optional query
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   614
                        \?%(uric)s*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   615
                        %(uri_end)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   616
                      )?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   617
                      (                       # optional fragment
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   618
                        \#%(uric)s*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   619
                        %(uri_end)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   620
                      )?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   621
                    )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   622
                  )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   623
                |                       # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   624
                  (?P<email>              # email address
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   625
                    """ + email_pattern + r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   626
                  )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   627
                )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   628
                %(end_string_suffix)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   629
                """) % locals(), re.VERBOSE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   630
          pep=re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   631
                r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   632
                %(start_string_prefix)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   633
                (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   634
                  (pep-(?P<pepnum1>\d+)(.txt)?) # reference to source file
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   635
                |
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   636
                  (PEP\s+(?P<pepnum2>\d+))      # reference by name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   637
                )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   638
                %(end_string_suffix)s""" % locals(), re.VERBOSE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   639
          rfc=re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   640
                r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   641
                %(start_string_prefix)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   642
                (RFC(-|\s+)?(?P<rfcnum>\d+))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   643
                %(end_string_suffix)s""" % locals(), re.VERBOSE))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   644
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   645
    def quoted_start(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   646
        """Return 1 if inline markup start-string is 'quoted', 0 if not."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   647
        string = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   648
        start = match.start()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   649
        end = match.end()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   650
        if start == 0:                  # start-string at beginning of text
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   651
            return 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   652
        prestart = string[start - 1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   653
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   654
            poststart = string[end]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   655
            if self.openers.index(prestart) \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   656
                  == self.closers.index(poststart):   # quoted
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   657
                return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   658
        except IndexError:              # start-string at end of text
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   659
            return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   660
        except ValueError:              # not quoted
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   661
            pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   662
        return 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   663
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   664
    def inline_obj(self, match, lineno, end_pattern, nodeclass,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   665
                   restore_backslashes=0):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   666
        string = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   667
        matchstart = match.start('start')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   668
        matchend = match.end('start')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   669
        if self.quoted_start(match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   670
            return (string[:matchend], [], string[matchend:], [], '')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   671
        endmatch = end_pattern.search(string[matchend:])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   672
        if endmatch and endmatch.start(1):  # 1 or more chars
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   673
            text = unescape(endmatch.string[:endmatch.start(1)],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   674
                            restore_backslashes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   675
            textend = matchend + endmatch.end(1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   676
            rawsource = unescape(string[matchstart:textend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   677
            return (string[:matchstart], [nodeclass(rawsource, text)],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   678
                    string[textend:], [], endmatch.group(1))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   679
        msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   680
              'Inline %s start-string without end-string.'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   681
              % nodeclass.__name__, line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   682
        text = unescape(string[matchstart:matchend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   683
        rawsource = unescape(string[matchstart:matchend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   684
        prb = self.problematic(text, rawsource, msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   685
        return string[:matchstart], [prb], string[matchend:], [msg], ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   686
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   687
    def problematic(self, text, rawsource, message):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   688
        msgid = self.document.set_id(message, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   689
        problematic = nodes.problematic(rawsource, text, refid=msgid)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   690
        prbid = self.document.set_id(problematic)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   691
        message.add_backref(prbid)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   692
        return problematic
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   693
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   694
    def emphasis(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   695
        before, inlines, remaining, sysmessages, endstring = self.inline_obj(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   696
              match, lineno, self.patterns.emphasis, nodes.emphasis)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   697
        return before, inlines, remaining, sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   698
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   699
    def strong(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   700
        before, inlines, remaining, sysmessages, endstring = self.inline_obj(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   701
              match, lineno, self.patterns.strong, nodes.strong)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   702
        return before, inlines, remaining, sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   703
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   704
    def interpreted_or_phrase_ref(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   705
        end_pattern = self.patterns.interpreted_or_phrase_ref
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   706
        string = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   707
        matchstart = match.start('backquote')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   708
        matchend = match.end('backquote')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   709
        rolestart = match.start('role')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   710
        role = match.group('role')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   711
        position = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   712
        if role:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   713
            role = role[1:-1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   714
            position = 'prefix'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   715
        elif self.quoted_start(match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   716
            return (string[:matchend], [], string[matchend:], [])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   717
        endmatch = end_pattern.search(string[matchend:])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   718
        if endmatch and endmatch.start(1):  # 1 or more chars
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   719
            textend = matchend + endmatch.end()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   720
            if endmatch.group('role'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   721
                if role:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   722
                    msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   723
                        'Multiple roles in interpreted text (both '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   724
                        'prefix and suffix present; only one allowed).',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   725
                        line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   726
                    text = unescape(string[rolestart:textend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   727
                    prb = self.problematic(text, text, msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   728
                    return string[:rolestart], [prb], string[textend:], [msg]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   729
                role = endmatch.group('suffix')[1:-1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   730
                position = 'suffix'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   731
            escaped = endmatch.string[:endmatch.start(1)]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   732
            rawsource = unescape(string[matchstart:textend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   733
            if rawsource[-1:] == '_':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   734
                if role:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   735
                    msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   736
                          'Mismatch: both interpreted text role %s and '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   737
                          'reference suffix.' % position, line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   738
                    text = unescape(string[rolestart:textend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   739
                    prb = self.problematic(text, text, msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   740
                    return string[:rolestart], [prb], string[textend:], [msg]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   741
                return self.phrase_ref(string[:matchstart], string[textend:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   742
                                       rawsource, escaped, unescape(escaped))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   743
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   744
                rawsource = unescape(string[rolestart:textend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   745
                nodelist, messages = self.interpreted(rawsource, escaped, role,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   746
                                                      lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   747
                return (string[:rolestart], nodelist,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   748
                        string[textend:], messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   749
        msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   750
              'Inline interpreted text or phrase reference start-string '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   751
              'without end-string.', line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   752
        text = unescape(string[matchstart:matchend], 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   753
        prb = self.problematic(text, text, msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   754
        return string[:matchstart], [prb], string[matchend:], [msg]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   755
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   756
    def phrase_ref(self, before, after, rawsource, escaped, text):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   757
        match = self.patterns.embedded_uri.search(escaped)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   758
        if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   759
            text = unescape(escaped[:match.start(0)])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   760
            uri_text = match.group(2)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   761
            uri = ''.join(uri_text.split())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   762
            uri = self.adjust_uri(uri)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   763
            if uri:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   764
                target = nodes.target(match.group(1), refuri=uri)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   765
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   766
                raise ApplicationError('problem with URI: %r' % uri_text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   767
            if not text:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   768
                text = uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   769
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   770
            target = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   771
        refname = normalize_name(text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   772
        reference = nodes.reference(rawsource, text,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   773
                                    name=whitespace_normalize_name(text))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   774
        node_list = [reference]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   775
        if rawsource[-2:] == '__':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   776
            if target:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   777
                reference['refuri'] = uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   778
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   779
                reference['anonymous'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   780
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   781
            if target:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   782
                reference['refuri'] = uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   783
                target['names'].append(refname)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   784
                self.document.note_explicit_target(target, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   785
                node_list.append(target)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   786
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   787
                reference['refname'] = refname
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   788
                self.document.note_refname(reference)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   789
        return before, node_list, after, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   790
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   791
    def adjust_uri(self, uri):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   792
        match = self.patterns.email.match(uri)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   793
        if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   794
            return 'mailto:' + uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   795
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   796
            return uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   797
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   798
    def interpreted(self, rawsource, text, role, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   799
        role_fn, messages = roles.role(role, self.language, lineno,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   800
                                       self.reporter)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   801
        if role_fn:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   802
            nodes, messages2 = role_fn(role, rawsource, text, lineno, self)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   803
            return nodes, messages + messages2
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   804
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   805
            msg = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   806
                'Unknown interpreted text role "%s".' % role,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   807
                line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   808
            return ([self.problematic(rawsource, rawsource, msg)],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   809
                    messages + [msg])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   810
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   811
    def literal(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   812
        before, inlines, remaining, sysmessages, endstring = self.inline_obj(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   813
              match, lineno, self.patterns.literal, nodes.literal,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   814
              restore_backslashes=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   815
        return before, inlines, remaining, sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   816
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   817
    def inline_internal_target(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   818
        before, inlines, remaining, sysmessages, endstring = self.inline_obj(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   819
              match, lineno, self.patterns.target, nodes.target)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   820
        if inlines and isinstance(inlines[0], nodes.target):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   821
            assert len(inlines) == 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   822
            target = inlines[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   823
            name = normalize_name(target.astext())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   824
            target['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   825
            self.document.note_explicit_target(target, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   826
        return before, inlines, remaining, sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   827
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   828
    def substitution_reference(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   829
        before, inlines, remaining, sysmessages, endstring = self.inline_obj(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   830
              match, lineno, self.patterns.substitution_ref,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   831
              nodes.substitution_reference)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   832
        if len(inlines) == 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   833
            subref_node = inlines[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   834
            if isinstance(subref_node, nodes.substitution_reference):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   835
                subref_text = subref_node.astext()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   836
                self.document.note_substitution_ref(subref_node, subref_text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   837
                if endstring[-1:] == '_':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   838
                    reference_node = nodes.reference(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   839
                        '|%s%s' % (subref_text, endstring), '')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   840
                    if endstring[-2:] == '__':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   841
                        reference_node['anonymous'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   842
                    else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   843
                        reference_node['refname'] = normalize_name(subref_text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   844
                        self.document.note_refname(reference_node)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   845
                    reference_node += subref_node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   846
                    inlines = [reference_node]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   847
        return before, inlines, remaining, sysmessages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   848
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   849
    def footnote_reference(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   850
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   851
        Handles `nodes.footnote_reference` and `nodes.citation_reference`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   852
        elements.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   853
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   854
        label = match.group('footnotelabel')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   855
        refname = normalize_name(label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   856
        string = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   857
        before = string[:match.start('whole')]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   858
        remaining = string[match.end('whole'):]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   859
        if match.group('citationlabel'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   860
            refnode = nodes.citation_reference('[%s]_' % label,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   861
                                               refname=refname)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   862
            refnode += nodes.Text(label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   863
            self.document.note_citation_ref(refnode)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   864
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   865
            refnode = nodes.footnote_reference('[%s]_' % label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   866
            if refname[0] == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   867
                refname = refname[1:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   868
                refnode['auto'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   869
                self.document.note_autofootnote_ref(refnode)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   870
            elif refname == '*':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   871
                refname = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   872
                refnode['auto'] = '*'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   873
                self.document.note_symbol_footnote_ref(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   874
                      refnode)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   875
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   876
                refnode += nodes.Text(label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   877
            if refname:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   878
                refnode['refname'] = refname
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   879
                self.document.note_footnote_ref(refnode)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   880
            if utils.get_trim_footnote_ref_space(self.document.settings):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   881
                before = before.rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   882
        return (before, [refnode], remaining, [])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   883
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   884
    def reference(self, match, lineno, anonymous=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   885
        referencename = match.group('refname')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   886
        refname = normalize_name(referencename)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   887
        referencenode = nodes.reference(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   888
            referencename + match.group('refend'), referencename,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   889
            name=whitespace_normalize_name(referencename))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   890
        if anonymous:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   891
            referencenode['anonymous'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   892
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   893
            referencenode['refname'] = refname
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   894
            self.document.note_refname(referencenode)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   895
        string = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   896
        matchstart = match.start('whole')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   897
        matchend = match.end('whole')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   898
        return (string[:matchstart], [referencenode], string[matchend:], [])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   899
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   900
    def anonymous_reference(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   901
        return self.reference(match, lineno, anonymous=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   902
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   903
    def standalone_uri(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   904
        if not match.group('scheme') or urischemes.schemes.has_key(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   905
              match.group('scheme').lower()):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   906
            if match.group('email'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   907
                addscheme = 'mailto:'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   908
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   909
                addscheme = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   910
            text = match.group('whole')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   911
            unescaped = unescape(text, 0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   912
            return [nodes.reference(unescape(text, 1), unescaped,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   913
                                    refuri=addscheme + unescaped)]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   914
        else:                   # not a valid scheme
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   915
            raise MarkupMismatch
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   916
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   917
    def pep_reference(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   918
        text = match.group(0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   919
        if text.startswith('pep-'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   920
            pepnum = int(match.group('pepnum1'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   921
        elif text.startswith('PEP'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   922
            pepnum = int(match.group('pepnum2'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   923
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   924
            raise MarkupMismatch
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   925
        ref = (self.document.settings.pep_base_url
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   926
               + self.document.settings.pep_file_url_template % pepnum)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   927
        unescaped = unescape(text, 0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   928
        return [nodes.reference(unescape(text, 1), unescaped, refuri=ref)]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   929
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   930
    rfc_url = 'rfc%d.html'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   931
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   932
    def rfc_reference(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   933
        text = match.group(0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   934
        if text.startswith('RFC'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   935
            rfcnum = int(match.group('rfcnum'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   936
            ref = self.document.settings.rfc_base_url + self.rfc_url % rfcnum
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   937
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   938
            raise MarkupMismatch
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   939
        unescaped = unescape(text, 0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   940
        return [nodes.reference(unescape(text, 1), unescaped, refuri=ref)]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   941
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   942
    def implicit_inline(self, text, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   943
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   944
        Check each of the patterns in `self.implicit_dispatch` for a match,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   945
        and dispatch to the stored method for the pattern.  Recursively check
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   946
        the text before and after the match.  Return a list of `nodes.Text`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   947
        and inline element nodes.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   948
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   949
        if not text:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   950
            return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   951
        for pattern, method in self.implicit_dispatch:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   952
            match = pattern.search(text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   953
            if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   954
                try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   955
                    # Must recurse on strings before *and* after the match;
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   956
                    # there may be multiple patterns.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   957
                    return (self.implicit_inline(text[:match.start()], lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   958
                            + method(match, lineno) +
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   959
                            self.implicit_inline(text[match.end():], lineno))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   960
                except MarkupMismatch:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   961
                    pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   962
        return [nodes.Text(unescape(text), rawsource=unescape(text, 1))]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   963
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   964
    dispatch = {'*': emphasis,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   965
                '**': strong,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   966
                '`': interpreted_or_phrase_ref,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   967
                '``': literal,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   968
                '_`': inline_internal_target,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   969
                ']_': footnote_reference,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   970
                '|': substitution_reference,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   971
                '_': reference,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   972
                '__': anonymous_reference}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   973
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   974
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   975
def _loweralpha_to_int(s, _zero=(ord('a')-1)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   976
    return ord(s) - _zero
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   977
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   978
def _upperalpha_to_int(s, _zero=(ord('A')-1)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   979
    return ord(s) - _zero
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   980
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   981
def _lowerroman_to_int(s):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   982
    return roman.fromRoman(s.upper())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   983
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   984
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   985
class Body(RSTState):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   986
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   987
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   988
    Generic classifier of the first line of a block.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   989
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   990
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   991
    double_width_pad_char = tableparser.TableParser.double_width_pad_char
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   992
    """Padding character for East Asian double-width text."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   993
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   994
    enum = Struct()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   995
    """Enumerated list parsing information."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   996
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   997
    enum.formatinfo = {
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   998
          'parens': Struct(prefix='(', suffix=')', start=1, end=-1),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
   999
          'rparen': Struct(prefix='', suffix=')', start=0, end=-1),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1000
          'period': Struct(prefix='', suffix='.', start=0, end=-1)}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1001
    enum.formats = enum.formatinfo.keys()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1002
    enum.sequences = ['arabic', 'loweralpha', 'upperalpha',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1003
                      'lowerroman', 'upperroman'] # ORDERED!
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1004
    enum.sequencepats = {'arabic': '[0-9]+',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1005
                         'loweralpha': '[a-z]',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1006
                         'upperalpha': '[A-Z]',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1007
                         'lowerroman': '[ivxlcdm]+',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1008
                         'upperroman': '[IVXLCDM]+',}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1009
    enum.converters = {'arabic': int,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1010
                       'loweralpha': _loweralpha_to_int,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1011
                       'upperalpha': _upperalpha_to_int,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1012
                       'lowerroman': _lowerroman_to_int,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1013
                       'upperroman': roman.fromRoman}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1014
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1015
    enum.sequenceregexps = {}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1016
    for sequence in enum.sequences:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1017
        enum.sequenceregexps[sequence] = re.compile(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1018
              enum.sequencepats[sequence] + '$')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1019
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1020
    grid_table_top_pat = re.compile(r'\+-[-+]+-\+ *$')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1021
    """Matches the top (& bottom) of a full table)."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1022
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1023
    simple_table_top_pat = re.compile('=+( +=+)+ *$')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1024
    """Matches the top of a simple table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1025
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1026
    simple_table_border_pat = re.compile('=+[ =]*$')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1027
    """Matches the bottom & header bottom of a simple table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1028
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1029
    pats = {}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1030
    """Fragments of patterns used by transitions."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1031
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1032
    pats['nonalphanum7bit'] = '[!-/:-@[-`{-~]'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1033
    pats['alpha'] = '[a-zA-Z]'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1034
    pats['alphanum'] = '[a-zA-Z0-9]'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1035
    pats['alphanumplus'] = '[a-zA-Z0-9_-]'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1036
    pats['enum'] = ('(%(arabic)s|%(loweralpha)s|%(upperalpha)s|%(lowerroman)s'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1037
                    '|%(upperroman)s|#)' % enum.sequencepats)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1038
    pats['optname'] = '%(alphanum)s%(alphanumplus)s*' % pats
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1039
    # @@@ Loosen up the pattern?  Allow Unicode?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1040
    pats['optarg'] = '(%(alpha)s%(alphanumplus)s*|<[^<>]+>)' % pats
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1041
    pats['shortopt'] = r'(-|\+)%(alphanum)s( ?%(optarg)s)?' % pats
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1042
    pats['longopt'] = r'(--|/)%(optname)s([ =]%(optarg)s)?' % pats
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1043
    pats['option'] = r'(%(shortopt)s|%(longopt)s)' % pats
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1044
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1045
    for format in enum.formats:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1046
        pats[format] = '(?P<%s>%s%s%s)' % (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1047
              format, re.escape(enum.formatinfo[format].prefix),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1048
              pats['enum'], re.escape(enum.formatinfo[format].suffix))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1049
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1050
    patterns = {
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1051
          'bullet': ur'[-+*\u2022\u2023\u2043]( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1052
          'enumerator': r'(%(parens)s|%(rparen)s|%(period)s)( +|$)' % pats,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1053
          'field_marker': r':(?![: ])([^:\\]|\\.)*(?<! ):( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1054
          'option_marker': r'%(option)s(, %(option)s)*(  +| ?$)' % pats,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1055
          'doctest': r'>>>( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1056
          'line_block': r'\|( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1057
          'grid_table_top': grid_table_top_pat,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1058
          'simple_table_top': simple_table_top_pat,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1059
          'explicit_markup': r'\.\.( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1060
          'anonymous': r'__( +|$)',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1061
          'line': r'(%(nonalphanum7bit)s)\1* *$' % pats,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1062
          'text': r''}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1063
    initial_transitions = (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1064
          'bullet',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1065
          'enumerator',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1066
          'field_marker',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1067
          'option_marker',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1068
          'doctest',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1069
          'line_block',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1070
          'grid_table_top',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1071
          'simple_table_top',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1072
          'explicit_markup',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1073
          'anonymous',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1074
          'line',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1075
          'text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1076
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1077
    def indent(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1078
        """Block quote."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1079
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1080
              self.state_machine.get_indented()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1081
        elements = self.block_quote(indented, line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1082
        self.parent += elements
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1083
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1084
            self.parent += self.unindent_warning('Block quote')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1085
        return context, next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1086
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1087
    def block_quote(self, indented, line_offset):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1088
        elements = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1089
        while indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1090
            (blockquote_lines,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1091
             attribution_lines,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1092
             attribution_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1093
             indented,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1094
             new_line_offset) = self.split_attribution(indented, line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1095
            blockquote = nodes.block_quote()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1096
            self.nested_parse(blockquote_lines, line_offset, blockquote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1097
            elements.append(blockquote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1098
            if attribution_lines:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1099
                attribution, messages = self.parse_attribution(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1100
                    attribution_lines, attribution_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1101
                blockquote += attribution
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1102
                elements += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1103
            line_offset = new_line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1104
            while indented and not indented[0]:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1105
                indented = indented[1:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1106
                line_offset += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1107
        return elements
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1108
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1109
    # U+2014 is an em-dash:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1110
    attribution_pattern = re.compile(ur'(---?(?!-)|\u2014) *(?=[^ \n])')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1111
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1112
    def split_attribution(self, indented, line_offset):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1113
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1114
        Check for a block quote attribution and split it off:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1115
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1116
        * First line after a blank line must begin with a dash ("--", "---",
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1117
          em-dash; matches `self.attribution_pattern`).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1118
        * Every line after that must have consistent indentation.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1119
        * Attributions must be preceded by block quote content.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1120
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1121
        Return a tuple of: (block quote content lines, content offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1122
        attribution lines, attribution offset, remaining indented lines).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1123
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1124
        blank = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1125
        nonblank_seen = False
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1126
        for i in range(len(indented)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1127
            line = indented[i].rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1128
            if line:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1129
                if nonblank_seen and blank == i - 1: # last line blank
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1130
                    match = self.attribution_pattern.match(line)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1131
                    if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1132
                        attribution_end, indent = self.check_attribution(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1133
                            indented, i)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1134
                        if attribution_end:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1135
                            a_lines = indented[i:attribution_end]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1136
                            a_lines.trim_left(match.end(), end=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1137
                            a_lines.trim_left(indent, start=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1138
                            return (indented[:i], a_lines,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1139
                                    i, indented[attribution_end:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1140
                                    line_offset + attribution_end)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1141
                nonblank_seen = True
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1142
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1143
                blank = i
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1144
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1145
            return (indented, None, None, None, None)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1146
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1147
    def check_attribution(self, indented, attribution_start):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1148
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1149
        Check attribution shape.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1150
        Return the index past the end of the attribution, and the indent.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1151
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1152
        indent = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1153
        i = attribution_start + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1154
        for i in range(attribution_start + 1, len(indented)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1155
            line = indented[i].rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1156
            if not line:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1157
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1158
            if indent is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1159
                indent = len(line) - len(line.lstrip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1160
            elif len(line) - len(line.lstrip()) != indent:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1161
                return None, None       # bad shape; not an attribution
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1162
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1163
            # return index of line after last attribution line:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1164
            i += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1165
        return i, (indent or 0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1166
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1167
    def parse_attribution(self, indented, line_offset):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1168
        text = '\n'.join(indented).rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1169
        lineno = self.state_machine.abs_line_number() + line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1170
        textnodes, messages = self.inline_text(text, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1171
        node = nodes.attribution(text, '', *textnodes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1172
        node.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1173
        return node, messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1174
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1175
    def bullet(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1176
        """Bullet list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1177
        bulletlist = nodes.bullet_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1178
        self.parent += bulletlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1179
        bulletlist['bullet'] = match.string[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1180
        i, blank_finish = self.list_item(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1181
        bulletlist += i
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1182
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1183
        new_line_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1184
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1185
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1186
              node=bulletlist, initial_state='BulletList',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1187
              blank_finish=blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1188
        self.goto_line(new_line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1189
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1190
            self.parent += self.unindent_warning('Bullet list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1191
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1192
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1193
    def list_item(self, indent):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1194
        if self.state_machine.line[indent:]:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1195
            indented, line_offset, blank_finish = (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1196
                self.state_machine.get_known_indented(indent))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1197
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1198
            indented, indent, line_offset, blank_finish = (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1199
                self.state_machine.get_first_known_indented(indent))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1200
        listitem = nodes.list_item('\n'.join(indented))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1201
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1202
            self.nested_parse(indented, input_offset=line_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1203
                              node=listitem)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1204
        return listitem, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1205
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1206
    def enumerator(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1207
        """Enumerated List Item"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1208
        format, sequence, text, ordinal = self.parse_enumerator(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1209
        if not self.is_enumerated_list_item(ordinal, sequence, format):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1210
            raise statemachine.TransitionCorrection('text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1211
        enumlist = nodes.enumerated_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1212
        self.parent += enumlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1213
        if sequence == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1214
            enumlist['enumtype'] = 'arabic'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1215
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1216
            enumlist['enumtype'] = sequence
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1217
        enumlist['prefix'] = self.enum.formatinfo[format].prefix
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1218
        enumlist['suffix'] = self.enum.formatinfo[format].suffix
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1219
        if ordinal != 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1220
            enumlist['start'] = ordinal
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1221
            msg = self.reporter.info(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1222
                'Enumerated list start value not ordinal-1: "%s" (ordinal %s)'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1223
                % (text, ordinal), line=self.state_machine.abs_line_number())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1224
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1225
        listitem, blank_finish = self.list_item(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1226
        enumlist += listitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1227
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1228
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1229
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1230
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1231
              node=enumlist, initial_state='EnumeratedList',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1232
              blank_finish=blank_finish,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1233
              extra_settings={'lastordinal': ordinal,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1234
                              'format': format,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1235
                              'auto': sequence == '#'})
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1236
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1237
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1238
            self.parent += self.unindent_warning('Enumerated list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1239
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1240
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1241
    def parse_enumerator(self, match, expected_sequence=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1242
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1243
        Analyze an enumerator and return the results.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1244
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1245
        :Return:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1246
            - the enumerator format ('period', 'parens', or 'rparen'),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1247
            - the sequence used ('arabic', 'loweralpha', 'upperroman', etc.),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1248
            - the text of the enumerator, stripped of formatting, and
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1249
            - the ordinal value of the enumerator ('a' -> 1, 'ii' -> 2, etc.;
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1250
              ``None`` is returned for invalid enumerator text).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1251
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1252
        The enumerator format has already been determined by the regular
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1253
        expression match. If `expected_sequence` is given, that sequence is
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1254
        tried first. If not, we check for Roman numeral 1. This way,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1255
        single-character Roman numerals (which are also alphabetical) can be
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1256
        matched. If no sequence has been matched, all sequences are checked in
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1257
        order.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1258
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1259
        groupdict = match.groupdict()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1260
        sequence = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1261
        for format in self.enum.formats:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1262
            if groupdict[format]:       # was this the format matched?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1263
                break                   # yes; keep `format`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1264
        else:                           # shouldn't happen
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1265
            raise ParserError('enumerator format not matched')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1266
        text = groupdict[format][self.enum.formatinfo[format].start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1267
                                 :self.enum.formatinfo[format].end]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1268
        if text == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1269
            sequence = '#'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1270
        elif expected_sequence:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1271
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1272
                if self.enum.sequenceregexps[expected_sequence].match(text):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1273
                    sequence = expected_sequence
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1274
            except KeyError:            # shouldn't happen
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1275
                raise ParserError('unknown enumerator sequence: %s'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1276
                                  % sequence)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1277
        elif text == 'i':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1278
            sequence = 'lowerroman'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1279
        elif text == 'I':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1280
            sequence = 'upperroman'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1281
        if not sequence:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1282
            for sequence in self.enum.sequences:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1283
                if self.enum.sequenceregexps[sequence].match(text):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1284
                    break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1285
            else:                       # shouldn't happen
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1286
                raise ParserError('enumerator sequence not matched')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1287
        if sequence == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1288
            ordinal = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1289
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1290
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1291
                ordinal = self.enum.converters[sequence](text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1292
            except roman.InvalidRomanNumeralError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1293
                ordinal = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1294
        return format, sequence, text, ordinal
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1295
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1296
    def is_enumerated_list_item(self, ordinal, sequence, format):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1297
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1298
        Check validity based on the ordinal value and the second line.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1299
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1300
        Return true iff the ordinal is valid and the second line is blank,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1301
        indented, or starts with the next enumerator or an auto-enumerator.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1302
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1303
        if ordinal is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1304
            return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1305
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1306
            next_line = self.state_machine.next_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1307
        except EOFError:              # end of input lines
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1308
            self.state_machine.previous_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1309
            return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1310
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1311
            self.state_machine.previous_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1312
        if not next_line[:1].strip():   # blank or indented
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1313
            return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1314
        result = self.make_enumerator(ordinal + 1, sequence, format)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1315
        if result:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1316
            next_enumerator, auto_enumerator = result
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1317
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1318
                if ( next_line.startswith(next_enumerator) or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1319
                     next_line.startswith(auto_enumerator) ):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1320
                    return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1321
            except TypeError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1322
                pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1323
        return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1324
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1325
    def make_enumerator(self, ordinal, sequence, format):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1326
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1327
        Construct and return the next enumerated list item marker, and an
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1328
        auto-enumerator ("#" instead of the regular enumerator).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1329
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1330
        Return ``None`` for invalid (out of range) ordinals.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1331
        """ #"
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1332
        if sequence == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1333
            enumerator = '#'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1334
        elif sequence == 'arabic':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1335
            enumerator = str(ordinal)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1336
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1337
            if sequence.endswith('alpha'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1338
                if ordinal > 26:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1339
                    return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1340
                enumerator = chr(ordinal + ord('a') - 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1341
            elif sequence.endswith('roman'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1342
                try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1343
                    enumerator = roman.toRoman(ordinal)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1344
                except roman.RomanError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1345
                    return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1346
            else:                       # shouldn't happen
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1347
                raise ParserError('unknown enumerator sequence: "%s"'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1348
                                  % sequence)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1349
            if sequence.startswith('lower'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1350
                enumerator = enumerator.lower()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1351
            elif sequence.startswith('upper'):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1352
                enumerator = enumerator.upper()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1353
            else:                       # shouldn't happen
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1354
                raise ParserError('unknown enumerator sequence: "%s"'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1355
                                  % sequence)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1356
        formatinfo = self.enum.formatinfo[format]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1357
        next_enumerator = (formatinfo.prefix + enumerator + formatinfo.suffix
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1358
                           + ' ')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1359
        auto_enumerator = formatinfo.prefix + '#' + formatinfo.suffix + ' '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1360
        return next_enumerator, auto_enumerator
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1361
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1362
    def field_marker(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1363
        """Field list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1364
        field_list = nodes.field_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1365
        self.parent += field_list
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1366
        field, blank_finish = self.field(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1367
        field_list += field
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1368
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1369
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1370
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1371
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1372
              node=field_list, initial_state='FieldList',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1373
              blank_finish=blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1374
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1375
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1376
            self.parent += self.unindent_warning('Field list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1377
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1378
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1379
    def field(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1380
        name = self.parse_field_marker(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1381
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1382
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1383
              self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1384
        field_node = nodes.field()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1385
        field_node.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1386
        name_nodes, name_messages = self.inline_text(name, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1387
        field_node += nodes.field_name(name, '', *name_nodes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1388
        field_body = nodes.field_body('\n'.join(indented), *name_messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1389
        field_node += field_body
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1390
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1391
            self.parse_field_body(indented, line_offset, field_body)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1392
        return field_node, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1393
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1394
    def parse_field_marker(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1395
        """Extract & return field name from a field marker match."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1396
        field = match.group()[1:]        # strip off leading ':'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1397
        field = field[:field.rfind(':')] # strip off trailing ':' etc.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1398
        return field
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1399
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1400
    def parse_field_body(self, indented, offset, node):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1401
        self.nested_parse(indented, input_offset=offset, node=node)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1402
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1403
    def option_marker(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1404
        """Option list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1405
        optionlist = nodes.option_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1406
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1407
            listitem, blank_finish = self.option_list_item(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1408
        except MarkupError, (message, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1409
            # This shouldn't happen; pattern won't match.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1410
            msg = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1411
                'Invalid option list marker: %s' % message, line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1412
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1413
            indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1414
                  self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1415
            elements = self.block_quote(indented, line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1416
            self.parent += elements
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1417
            if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1418
                self.parent += self.unindent_warning('Option list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1419
            return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1420
        self.parent += optionlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1421
        optionlist += listitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1422
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1423
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1424
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1425
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1426
              node=optionlist, initial_state='OptionList',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1427
              blank_finish=blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1428
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1429
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1430
            self.parent += self.unindent_warning('Option list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1431
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1432
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1433
    def option_list_item(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1434
        offset = self.state_machine.abs_line_offset()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1435
        options = self.parse_option_marker(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1436
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1437
              self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1438
        if not indented:                # not an option list item
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1439
            self.goto_line(offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1440
            raise statemachine.TransitionCorrection('text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1441
        option_group = nodes.option_group('', *options)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1442
        description = nodes.description('\n'.join(indented))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1443
        option_list_item = nodes.option_list_item('', option_group,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1444
                                                  description)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1445
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1446
            self.nested_parse(indented, input_offset=line_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1447
                              node=description)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1448
        return option_list_item, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1449
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1450
    def parse_option_marker(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1451
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1452
        Return a list of `node.option` and `node.option_argument` objects,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1453
        parsed from an option marker match.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1454
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1455
        :Exception: `MarkupError` for invalid option markers.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1456
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1457
        optlist = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1458
        optionstrings = match.group().rstrip().split(', ')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1459
        for optionstring in optionstrings:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1460
            tokens = optionstring.split()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1461
            delimiter = ' '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1462
            firstopt = tokens[0].split('=')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1463
            if len(firstopt) > 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1464
                # "--opt=value" form
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1465
                tokens[:1] = firstopt
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1466
                delimiter = '='
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1467
            elif (len(tokens[0]) > 2
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1468
                  and ((tokens[0].startswith('-')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1469
                        and not tokens[0].startswith('--'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1470
                       or tokens[0].startswith('+'))):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1471
                # "-ovalue" form
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1472
                tokens[:1] = [tokens[0][:2], tokens[0][2:]]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1473
                delimiter = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1474
            if len(tokens) > 1 and (tokens[1].startswith('<')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1475
                                    and tokens[-1].endswith('>')):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1476
                # "-o <value1 value2>" form; join all values into one token
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1477
                tokens[1:] = [' '.join(tokens[1:])]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1478
            if 0 < len(tokens) <= 2:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1479
                option = nodes.option(optionstring)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1480
                option += nodes.option_string(tokens[0], tokens[0])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1481
                if len(tokens) > 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1482
                    option += nodes.option_argument(tokens[1], tokens[1],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1483
                                                    delimiter=delimiter)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1484
                optlist.append(option)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1485
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1486
                raise MarkupError(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1487
                    'wrong number of option tokens (=%s), should be 1 or 2: '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1488
                    '"%s"' % (len(tokens), optionstring),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1489
                    self.state_machine.abs_line_number() + 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1490
        return optlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1491
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1492
    def doctest(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1493
        data = '\n'.join(self.state_machine.get_text_block())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1494
        self.parent += nodes.doctest_block(data, data)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1495
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1496
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1497
    def line_block(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1498
        """First line of a line block."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1499
        block = nodes.line_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1500
        self.parent += block
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1501
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1502
        line, messages, blank_finish = self.line_block_line(match, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1503
        block += line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1504
        self.parent += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1505
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1506
            offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1507
            new_line_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1508
                  self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1509
                  input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1510
                  node=block, initial_state='LineBlock',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1511
                  blank_finish=0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1512
            self.goto_line(new_line_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1513
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1514
            self.parent += self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1515
                'Line block ends without a blank line.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1516
                line=(self.state_machine.abs_line_number() + 1))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1517
        if len(block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1518
            if block[0].indent is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1519
                block[0].indent = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1520
            self.nest_line_block_lines(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1521
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1522
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1523
    def line_block_line(self, match, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1524
        """Return one line element of a line_block."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1525
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1526
              self.state_machine.get_first_known_indented(match.end(),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1527
                                                          until_blank=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1528
        text = u'\n'.join(indented)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1529
        text_nodes, messages = self.inline_text(text, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1530
        line = nodes.line(text, '', *text_nodes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1531
        if match.string.rstrip() != '|': # not empty
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1532
            line.indent = len(match.group(1)) - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1533
        return line, messages, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1534
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1535
    def nest_line_block_lines(self, block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1536
        for index in range(1, len(block)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1537
            if block[index].indent is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1538
                block[index].indent = block[index - 1].indent
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1539
        self.nest_line_block_segment(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1540
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1541
    def nest_line_block_segment(self, block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1542
        indents = [item.indent for item in block]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1543
        least = min(indents)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1544
        new_items = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1545
        new_block = nodes.line_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1546
        for item in block:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1547
            if item.indent > least:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1548
                new_block.append(item)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1549
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1550
                if len(new_block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1551
                    self.nest_line_block_segment(new_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1552
                    new_items.append(new_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1553
                    new_block = nodes.line_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1554
                new_items.append(item)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1555
        if len(new_block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1556
            self.nest_line_block_segment(new_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1557
            new_items.append(new_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1558
        block[:] = new_items
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1559
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1560
    def grid_table_top(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1561
        """Top border of a full table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1562
        return self.table_top(match, context, next_state,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1563
                              self.isolate_grid_table,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1564
                              tableparser.GridTableParser)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1565
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1566
    def simple_table_top(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1567
        """Top border of a simple table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1568
        return self.table_top(match, context, next_state,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1569
                              self.isolate_simple_table,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1570
                              tableparser.SimpleTableParser)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1571
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1572
    def table_top(self, match, context, next_state,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1573
                  isolate_function, parser_class):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1574
        """Top border of a generic table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1575
        nodelist, blank_finish = self.table(isolate_function, parser_class)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1576
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1577
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1578
            msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1579
                'Blank line required after table.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1580
                line=self.state_machine.abs_line_number() + 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1581
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1582
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1583
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1584
    def table(self, isolate_function, parser_class):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1585
        """Parse a table."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1586
        block, messages, blank_finish = isolate_function()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1587
        if block:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1588
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1589
                parser = parser_class()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1590
                tabledata = parser.parse(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1591
                tableline = (self.state_machine.abs_line_number() - len(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1592
                             + 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1593
                table = self.build_table(tabledata, tableline)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1594
                nodelist = [table] + messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1595
            except tableparser.TableMarkupError, detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1596
                nodelist = self.malformed_table(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1597
                    block, ' '.join(detail.args)) + messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1598
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1599
            nodelist = messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1600
        return nodelist, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1601
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1602
    def isolate_grid_table(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1603
        messages = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1604
        blank_finish = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1605
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1606
            block = self.state_machine.get_text_block(flush_left=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1607
        except statemachine.UnexpectedIndentationError, instance:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1608
            block, source, lineno = instance.args
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1609
            messages.append(self.reporter.error('Unexpected indentation.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1610
                                                source=source, line=lineno))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1611
            blank_finish = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1612
        block.disconnect()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1613
        # for East Asian chars:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1614
        block.pad_double_width(self.double_width_pad_char)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1615
        width = len(block[0].strip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1616
        for i in range(len(block)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1617
            block[i] = block[i].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1618
            if block[i][0] not in '+|': # check left edge
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1619
                blank_finish = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1620
                self.state_machine.previous_line(len(block) - i)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1621
                del block[i:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1622
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1623
        if not self.grid_table_top_pat.match(block[-1]): # find bottom
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1624
            blank_finish = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1625
            # from second-last to third line of table:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1626
            for i in range(len(block) - 2, 1, -1):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1627
                if self.grid_table_top_pat.match(block[i]):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1628
                    self.state_machine.previous_line(len(block) - i + 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1629
                    del block[i+1:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1630
                    break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1631
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1632
                messages.extend(self.malformed_table(block))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1633
                return [], messages, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1634
        for i in range(len(block)):     # check right edge
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1635
            if len(block[i]) != width or block[i][-1] not in '+|':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1636
                messages.extend(self.malformed_table(block))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1637
                return [], messages, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1638
        return block, messages, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1639
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1640
    def isolate_simple_table(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1641
        start = self.state_machine.line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1642
        lines = self.state_machine.input_lines
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1643
        limit = len(lines) - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1644
        toplen = len(lines[start].strip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1645
        pattern_match = self.simple_table_border_pat.match
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1646
        found = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1647
        found_at = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1648
        i = start + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1649
        while i <= limit:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1650
            line = lines[i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1651
            match = pattern_match(line)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1652
            if match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1653
                if len(line.strip()) != toplen:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1654
                    self.state_machine.next_line(i - start)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1655
                    messages = self.malformed_table(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1656
                        lines[start:i+1], 'Bottom/header table border does '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1657
                        'not match top border.')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1658
                    return [], messages, i == limit or not lines[i+1].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1659
                found += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1660
                found_at = i
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1661
                if found == 2 or i == limit or not lines[i+1].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1662
                    end = i
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1663
                    break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1664
            i += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1665
        else:                           # reached end of input_lines
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1666
            if found:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1667
                extra = ' or no blank line after table bottom'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1668
                self.state_machine.next_line(found_at - start)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1669
                block = lines[start:found_at+1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1670
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1671
                extra = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1672
                self.state_machine.next_line(i - start - 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1673
                block = lines[start:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1674
            messages = self.malformed_table(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1675
                block, 'No bottom table border found%s.' % extra)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1676
            return [], messages, not extra
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1677
        self.state_machine.next_line(end - start)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1678
        block = lines[start:end+1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1679
        # for East Asian chars:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1680
        block.pad_double_width(self.double_width_pad_char)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1681
        return block, [], end == limit or not lines[end+1].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1682
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1683
    def malformed_table(self, block, detail=''):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1684
        block.replace(self.double_width_pad_char, '')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1685
        data = '\n'.join(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1686
        message = 'Malformed table.'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1687
        lineno = self.state_machine.abs_line_number() - len(block) + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1688
        if detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1689
            message += '\n' + detail
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1690
        error = self.reporter.error(message, nodes.literal_block(data, data),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1691
                                    line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1692
        return [error]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1693
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1694
    def build_table(self, tabledata, tableline, stub_columns=0):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1695
        colwidths, headrows, bodyrows = tabledata
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1696
        table = nodes.table()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1697
        tgroup = nodes.tgroup(cols=len(colwidths))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1698
        table += tgroup
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1699
        for colwidth in colwidths:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1700
            colspec = nodes.colspec(colwidth=colwidth)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1701
            if stub_columns:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1702
                colspec.attributes['stub'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1703
                stub_columns -= 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1704
            tgroup += colspec
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1705
        if headrows:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1706
            thead = nodes.thead()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1707
            tgroup += thead
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1708
            for row in headrows:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1709
                thead += self.build_table_row(row, tableline)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1710
        tbody = nodes.tbody()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1711
        tgroup += tbody
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1712
        for row in bodyrows:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1713
            tbody += self.build_table_row(row, tableline)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1714
        return table
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1715
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1716
    def build_table_row(self, rowdata, tableline):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1717
        row = nodes.row()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1718
        for cell in rowdata:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1719
            if cell is None:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1720
                continue
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1721
            morerows, morecols, offset, cellblock = cell
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1722
            attributes = {}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1723
            if morerows:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1724
                attributes['morerows'] = morerows
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1725
            if morecols:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1726
                attributes['morecols'] = morecols
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1727
            entry = nodes.entry(**attributes)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1728
            row += entry
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1729
            if ''.join(cellblock):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1730
                self.nested_parse(cellblock, input_offset=tableline+offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1731
                                  node=entry)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1732
        return row
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1733
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1734
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1735
    explicit = Struct()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1736
    """Patterns and constants used for explicit markup recognition."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1737
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1738
    explicit.patterns = Struct(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1739
          target=re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1740
                            (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1741
                              _               # anonymous target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1742
                            |               # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1743
                              (?!_)           # no underscore at the beginning
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1744
                              (?P<quote>`?)   # optional open quote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1745
                              (?![ `])        # first char. not space or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1746
                                              # backquote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1747
                              (?P<name>       # reference name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1748
                                .+?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1749
                              )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1750
                              %(non_whitespace_escape_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1751
                              (?P=quote)      # close quote if open quote used
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1752
                            )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1753
                            (?<!(?<!\x00):) # no unescaped colon at end
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1754
                            %(non_whitespace_escape_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1755
                            [ ]?            # optional space
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1756
                            :               # end of reference name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1757
                            ([ ]+|$)        # followed by whitespace
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1758
                            """ % vars(Inliner), re.VERBOSE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1759
          reference=re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1760
                               (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1761
                                 (?P<simple>%(simplename)s)_
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1762
                               |                  # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1763
                                 `                  # open backquote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1764
                                 (?![ ])            # not space
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1765
                                 (?P<phrase>.+?)    # hyperlink phrase
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1766
                                 %(non_whitespace_escape_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1767
                                 `_                 # close backquote,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1768
                                                    # reference mark
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1769
                               )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1770
                               $                  # end of string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1771
                               """ % vars(Inliner), re.VERBOSE | re.UNICODE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1772
          substitution=re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1773
                                  (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1774
                                    (?![ ])          # first char. not space
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1775
                                    (?P<name>.+?)    # substitution text
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1776
                                    %(non_whitespace_escape_before)s
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1777
                                    \|               # close delimiter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1778
                                  )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1779
                                  ([ ]+|$)           # followed by whitespace
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1780
                                  """ % vars(Inliner), re.VERBOSE),)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1781
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1782
    def footnote(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1783
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1784
        indented, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1785
              self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1786
        label = match.group(1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1787
        name = normalize_name(label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1788
        footnote = nodes.footnote('\n'.join(indented))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1789
        footnote.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1790
        if name[0] == '#':              # auto-numbered
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1791
            name = name[1:]             # autonumber label
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1792
            footnote['auto'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1793
            if name:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1794
                footnote['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1795
            self.document.note_autofootnote(footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1796
        elif name == '*':               # auto-symbol
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1797
            name = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1798
            footnote['auto'] = '*'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1799
            self.document.note_symbol_footnote(footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1800
        else:                           # manually numbered
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1801
            footnote += nodes.label('', label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1802
            footnote['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1803
            self.document.note_footnote(footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1804
        if name:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1805
            self.document.note_explicit_target(footnote, footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1806
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1807
            self.document.set_id(footnote, footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1808
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1809
            self.nested_parse(indented, input_offset=offset, node=footnote)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1810
        return [footnote], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1811
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1812
    def citation(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1813
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1814
        indented, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1815
              self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1816
        label = match.group(1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1817
        name = normalize_name(label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1818
        citation = nodes.citation('\n'.join(indented))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1819
        citation.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1820
        citation += nodes.label('', label)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1821
        citation['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1822
        self.document.note_citation(citation)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1823
        self.document.note_explicit_target(citation, citation)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1824
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1825
            self.nested_parse(indented, input_offset=offset, node=citation)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1826
        return [citation], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1827
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1828
    def hyperlink_target(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1829
        pattern = self.explicit.patterns.target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1830
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1831
        block, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1832
              self.state_machine.get_first_known_indented(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1833
              match.end(), until_blank=1, strip_indent=0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1834
        blocktext = match.string[:match.end()] + '\n'.join(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1835
        block = [escape2null(line) for line in block]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1836
        escaped = block[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1837
        blockindex = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1838
        while 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1839
            targetmatch = pattern.match(escaped)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1840
            if targetmatch:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1841
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1842
            blockindex += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1843
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1844
                escaped += block[blockindex]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1845
            except IndexError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1846
                raise MarkupError('malformed hyperlink target.', lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1847
        del block[:blockindex]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1848
        block[0] = (block[0] + ' ')[targetmatch.end()-len(escaped)-1:].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1849
        target = self.make_target(block, blocktext, lineno,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1850
                                  targetmatch.group('name'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1851
        return [target], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1852
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1853
    def make_target(self, block, block_text, lineno, target_name):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1854
        target_type, data = self.parse_target(block, block_text, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1855
        if target_type == 'refname':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1856
            target = nodes.target(block_text, '', refname=normalize_name(data))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1857
            target.indirect_reference_name = data
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1858
            self.add_target(target_name, '', target, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1859
            self.document.note_indirect_target(target)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1860
            return target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1861
        elif target_type == 'refuri':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1862
            target = nodes.target(block_text, '')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1863
            self.add_target(target_name, data, target, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1864
            return target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1865
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1866
            return data
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1867
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1868
    def parse_target(self, block, block_text, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1869
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1870
        Determine the type of reference of a target.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1871
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1872
        :Return: A 2-tuple, one of:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1873
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1874
            - 'refname' and the indirect reference name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1875
            - 'refuri' and the URI
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1876
            - 'malformed' and a system_message node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1877
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1878
        if block and block[-1].strip()[-1:] == '_': # possible indirect target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1879
            reference = ' '.join([line.strip() for line in block])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1880
            refname = self.is_reference(reference)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1881
            if refname:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1882
                return 'refname', refname
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1883
        reference = ''.join([''.join(line.split()) for line in block])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1884
        return 'refuri', unescape(reference)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1885
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1886
    def is_reference(self, reference):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1887
        match = self.explicit.patterns.reference.match(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1888
            whitespace_normalize_name(reference))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1889
        if not match:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1890
            return None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1891
        return unescape(match.group('simple') or match.group('phrase'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1892
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1893
    def add_target(self, targetname, refuri, target, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1894
        target.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1895
        if targetname:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1896
            name = normalize_name(unescape(targetname))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1897
            target['names'].append(name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1898
            if refuri:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1899
                uri = self.inliner.adjust_uri(refuri)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1900
                if uri:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1901
                    target['refuri'] = uri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1902
                else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1903
                    raise ApplicationError('problem with URI: %r' % refuri)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1904
            self.document.note_explicit_target(target, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1905
        else:                       # anonymous target
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1906
            if refuri:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1907
                target['refuri'] = refuri
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1908
            target['anonymous'] = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1909
            self.document.note_anonymous_target(target)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1910
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1911
    def substitution_def(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1912
        pattern = self.explicit.patterns.substitution
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1913
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1914
        block, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1915
              self.state_machine.get_first_known_indented(match.end(),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1916
                                                          strip_indent=0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1917
        blocktext = (match.string[:match.end()] + '\n'.join(block))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1918
        block.disconnect()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1919
        escaped = escape2null(block[0].rstrip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1920
        blockindex = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1921
        while 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1922
            subdefmatch = pattern.match(escaped)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1923
            if subdefmatch:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1924
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1925
            blockindex += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1926
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1927
                escaped = escaped + ' ' + escape2null(block[blockindex].strip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1928
            except IndexError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1929
                raise MarkupError('malformed substitution definition.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1930
                                  lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1931
        del block[:blockindex]          # strip out the substitution marker
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1932
        block[0] = (block[0].strip() + ' ')[subdefmatch.end()-len(escaped)-1:-1]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1933
        if not block[0]:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1934
            del block[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1935
            offset += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1936
        while block and not block[-1].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1937
            block.pop()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1938
        subname = subdefmatch.group('name')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1939
        substitution_node = nodes.substitution_definition(blocktext)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1940
        substitution_node.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1941
        if not block:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1942
            msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1943
                'Substitution definition "%s" missing contents.' % subname,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1944
                nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1945
            return [msg], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1946
        block[0] = block[0].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1947
        substitution_node['names'].append(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1948
            nodes.whitespace_normalize_name(subname))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1949
        new_abs_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1950
              block, input_offset=offset, node=substitution_node,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1951
              initial_state='SubstitutionDef', blank_finish=blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1952
        i = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1953
        for node in substitution_node[:]:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1954
            if not (isinstance(node, nodes.Inline) or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1955
                    isinstance(node, nodes.Text)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1956
                self.parent += substitution_node[i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1957
                del substitution_node[i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1958
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1959
                i += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1960
        for node in substitution_node.traverse(nodes.Element):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1961
            if self.disallowed_inside_substitution_definitions(node):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1962
                pformat = nodes.literal_block('', node.pformat().rstrip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1963
                msg = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1964
                    'Substitution definition contains illegal element:',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1965
                    pformat, nodes.literal_block(blocktext, blocktext),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1966
                    line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1967
                return [msg], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1968
        if len(substitution_node) == 0:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1969
            msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1970
                  'Substitution definition "%s" empty or invalid.'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1971
                  % subname,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1972
                  nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1973
            return [msg], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1974
        self.document.note_substitution_def(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1975
            substitution_node, subname, self.parent)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1976
        return [substitution_node], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1977
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1978
    def disallowed_inside_substitution_definitions(self, node):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1979
        if (node['ids'] or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1980
            isinstance(node, nodes.reference) and node.get('anonymous') or
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1981
            isinstance(node, nodes.footnote_reference) and node.get('auto')):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1982
            return 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1983
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1984
            return 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1985
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1986
    def directive(self, match, **option_presets):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1987
        """Returns a 2-tuple: list of nodes, and a "blank finish" boolean."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1988
        type_name = match.group(1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1989
        directive_class, messages = directives.directive(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1990
            type_name, self.memo.language, self.document)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1991
        self.parent += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1992
        if directive_class:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1993
            return self.run_directive(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1994
                directive_class, match, type_name, option_presets)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1995
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1996
            return self.unknown_directive(type_name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1997
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1998
    def run_directive(self, directive, match, type_name, option_presets):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  1999
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2000
        Parse a directive then run its directive function.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2001
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2002
        Parameters:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2003
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2004
        - `directive`: The class implementing the directive.  Must be
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2005
          a subclass of `rst.Directive`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2006
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2007
        - `match`: A regular expression match object which matched the first
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2008
          line of the directive.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2009
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2010
        - `type_name`: The directive name, as used in the source text.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2011
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2012
        - `option_presets`: A dictionary of preset options, defaults for the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2013
          directive options.  Currently, only an "alt" option is passed by
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2014
          substitution definitions (value: the substitution name), which may
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2015
          be used by an embedded image directive.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2016
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2017
        Returns a 2-tuple: list of nodes, and a "blank finish" boolean.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2018
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2019
        if isinstance(directive, (FunctionType, MethodType)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2020
            from docutils.parsers.rst import convert_directive_function
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2021
            directive = convert_directive_function(directive)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2022
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2023
        initial_line_offset = self.state_machine.line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2024
        indented, indent, line_offset, blank_finish \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2025
                  = self.state_machine.get_first_known_indented(match.end(),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2026
                                                                strip_top=0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2027
        block_text = '\n'.join(self.state_machine.input_lines[
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2028
            initial_line_offset : self.state_machine.line_offset + 1])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2029
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2030
            arguments, options, content, content_offset = (
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2031
                self.parse_directive_block(indented, line_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2032
                                           directive, option_presets))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2033
        except MarkupError, detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2034
            error = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2035
                'Error in "%s" directive:\n%s.' % (type_name,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2036
                                                   ' '.join(detail.args)),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2037
                nodes.literal_block(block_text, block_text), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2038
            return [error], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2039
        directive_instance = directive(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2040
            type_name, arguments, options, content, lineno,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2041
            content_offset, block_text, self, self.state_machine)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2042
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2043
            result = directive_instance.run()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2044
        except docutils.parsers.rst.DirectiveError, directive_error:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2045
            msg_node = self.reporter.system_message(directive_error.level,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2046
                                                    directive_error.message)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2047
            msg_node += nodes.literal_block(block_text, block_text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2048
            msg_node['line'] = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2049
            result = [msg_node]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2050
        assert isinstance(result, list), \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2051
               'Directive "%s" must return a list of nodes.' % type_name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2052
        for i in range(len(result)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2053
            assert isinstance(result[i], nodes.Node), \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2054
                   ('Directive "%s" returned non-Node object (index %s): %r'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2055
                    % (type_name, i, result[i]))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2056
        return (result,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2057
                blank_finish or self.state_machine.is_next_line_blank())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2058
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2059
    def parse_directive_block(self, indented, line_offset, directive,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2060
                              option_presets):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2061
        option_spec = directive.option_spec
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2062
        has_content = directive.has_content
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2063
        if indented and not indented[0].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2064
            indented.trim_start()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2065
            line_offset += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2066
        while indented and not indented[-1].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2067
            indented.trim_end()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2068
        if indented and (directive.required_arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2069
                         or directive.optional_arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2070
                         or option_spec):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2071
            for i in range(len(indented)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2072
                if not indented[i].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2073
                    break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2074
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2075
                i += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2076
            arg_block = indented[:i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2077
            content = indented[i+1:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2078
            content_offset = line_offset + i + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2079
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2080
            content = indented
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2081
            content_offset = line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2082
            arg_block = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2083
        while content and not content[0].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2084
            content.trim_start()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2085
            content_offset += 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2086
        if option_spec:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2087
            options, arg_block = self.parse_directive_options(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2088
                option_presets, option_spec, arg_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2089
            if arg_block and not (directive.required_arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2090
                                  or directive.optional_arguments):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2091
                raise MarkupError('no arguments permitted; blank line '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2092
                                  'required before content block')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2093
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2094
            options = {}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2095
        if directive.required_arguments or directive.optional_arguments:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2096
            arguments = self.parse_directive_arguments(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2097
                directive, arg_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2098
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2099
            arguments = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2100
        if content and not has_content:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2101
            raise MarkupError('no content permitted')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2102
        return (arguments, options, content, content_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2103
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2104
    def parse_directive_options(self, option_presets, option_spec, arg_block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2105
        options = option_presets.copy()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2106
        for i in range(len(arg_block)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2107
            if arg_block[i][:1] == ':':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2108
                opt_block = arg_block[i:]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2109
                arg_block = arg_block[:i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2110
                break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2111
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2112
            opt_block = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2113
        if opt_block:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2114
            success, data = self.parse_extension_options(option_spec,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2115
                                                         opt_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2116
            if success:                 # data is a dict of options
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2117
                options.update(data)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2118
            else:                       # data is an error string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2119
                raise MarkupError(data)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2120
        return options, arg_block
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2121
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2122
    def parse_directive_arguments(self, directive, arg_block):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2123
        required = directive.required_arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2124
        optional = directive.optional_arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2125
        arg_text = '\n'.join(arg_block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2126
        arguments = arg_text.split()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2127
        if len(arguments) < required:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2128
            raise MarkupError('%s argument(s) required, %s supplied'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2129
                              % (required, len(arguments)))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2130
        elif len(arguments) > required + optional:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2131
            if directive.final_argument_whitespace:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2132
                arguments = arg_text.split(None, required + optional - 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2133
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2134
                raise MarkupError(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2135
                    'maximum %s argument(s) allowed, %s supplied'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2136
                    % (required + optional, len(arguments)))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2137
        return arguments
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2138
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2139
    def parse_extension_options(self, option_spec, datalines):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2140
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2141
        Parse `datalines` for a field list containing extension options
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2142
        matching `option_spec`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2143
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2144
        :Parameters:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2145
            - `option_spec`: a mapping of option name to conversion
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2146
              function, which should raise an exception on bad input.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2147
            - `datalines`: a list of input strings.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2148
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2149
        :Return:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2150
            - Success value, 1 or 0.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2151
            - An option dictionary on success, an error string on failure.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2152
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2153
        node = nodes.field_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2154
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2155
              datalines, 0, node, initial_state='ExtensionOptions',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2156
              blank_finish=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2157
        if newline_offset != len(datalines): # incomplete parse of block
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2158
            return 0, 'invalid option block'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2159
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2160
            options = utils.extract_extension_options(node, option_spec)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2161
        except KeyError, detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2162
            return 0, ('unknown option: "%s"' % detail.args[0])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2163
        except (ValueError, TypeError), detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2164
            return 0, ('invalid option value: %s' % ' '.join(detail.args))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2165
        except utils.ExtensionOptionError, detail:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2166
            return 0, ('invalid option data: %s' % ' '.join(detail.args))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2167
        if blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2168
            return 1, options
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2169
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2170
            return 0, 'option data incompletely parsed'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2171
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2172
    def unknown_directive(self, type_name):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2173
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2174
        indented, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2175
              self.state_machine.get_first_known_indented(0, strip_indent=0)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2176
        text = '\n'.join(indented)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2177
        error = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2178
              'Unknown directive type "%s".' % type_name,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2179
              nodes.literal_block(text, text), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2180
        return [error], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2181
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2182
    def comment(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2183
        if not match.string[match.end():].strip() \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2184
              and self.state_machine.is_next_line_blank(): # an empty comment?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2185
            return [nodes.comment()], 1 # "A tiny but practical wart."
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2186
        indented, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2187
              self.state_machine.get_first_known_indented(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2188
        while indented and not indented[-1].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2189
            indented.trim_end()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2190
        text = '\n'.join(indented)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2191
        return [nodes.comment(text, text)], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2192
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2193
    explicit.constructs = [
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2194
          (footnote,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2195
           re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2196
                      \.\.[ ]+          # explicit markup start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2197
                      \[
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2198
                      (                 # footnote label:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2199
                          [0-9]+          # manually numbered footnote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2200
                        |               # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2201
                          \#              # anonymous auto-numbered footnote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2202
                        |               # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2203
                          \#%s            # auto-number ed?) footnote label
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2204
                        |               # *OR*
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2205
                          \*              # auto-symbol footnote
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2206
                      )
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2207
                      \]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2208
                      ([ ]+|$)          # whitespace or end of line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2209
                      """ % Inliner.simplename, re.VERBOSE | re.UNICODE)),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2210
          (citation,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2211
           re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2212
                      \.\.[ ]+          # explicit markup start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2213
                      \[(%s)\]          # citation label
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2214
                      ([ ]+|$)          # whitespace or end of line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2215
                      """ % Inliner.simplename, re.VERBOSE | re.UNICODE)),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2216
          (hyperlink_target,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2217
           re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2218
                      \.\.[ ]+          # explicit markup start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2219
                      _                 # target indicator
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2220
                      (?![ ]|$)         # first char. not space or EOL
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2221
                      """, re.VERBOSE)),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2222
          (substitution_def,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2223
           re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2224
                      \.\.[ ]+          # explicit markup start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2225
                      \|                # substitution indicator
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2226
                      (?![ ]|$)         # first char. not space or EOL
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2227
                      """, re.VERBOSE)),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2228
          (directive,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2229
           re.compile(r"""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2230
                      \.\.[ ]+          # explicit markup start
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2231
                      (%s)              # directive name
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2232
                      [ ]?              # optional space
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2233
                      ::                # directive delimiter
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2234
                      ([ ]+|$)          # whitespace or end of line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2235
                      """ % Inliner.simplename, re.VERBOSE | re.UNICODE))]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2236
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2237
    def explicit_markup(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2238
        """Footnotes, hyperlink targets, directives, comments."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2239
        nodelist, blank_finish = self.explicit_construct(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2240
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2241
        self.explicit_list(blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2242
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2243
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2244
    def explicit_construct(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2245
        """Determine which explicit construct this is, parse & return it."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2246
        errors = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2247
        for method, pattern in self.explicit.constructs:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2248
            expmatch = pattern.match(match.string)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2249
            if expmatch:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2250
                try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2251
                    return method(self, expmatch)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2252
                except MarkupError, (message, lineno): # never reached?
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2253
                    errors.append(self.reporter.warning(message, line=lineno))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2254
                    break
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2255
        nodelist, blank_finish = self.comment(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2256
        return nodelist + errors, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2257
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2258
    def explicit_list(self, blank_finish):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2259
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2260
        Create a nested state machine for a series of explicit markup
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2261
        constructs (including anonymous hyperlink targets).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2262
        """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2263
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2264
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2265
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2266
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2267
              node=self.parent, initial_state='Explicit',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2268
              blank_finish=blank_finish,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2269
              match_titles=self.state_machine.match_titles)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2270
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2271
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2272
            self.parent += self.unindent_warning('Explicit markup')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2273
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2274
    def anonymous(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2275
        """Anonymous hyperlink targets."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2276
        nodelist, blank_finish = self.anonymous_target(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2277
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2278
        self.explicit_list(blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2279
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2280
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2281
    def anonymous_target(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2282
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2283
        block, indent, offset, blank_finish \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2284
              = self.state_machine.get_first_known_indented(match.end(),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2285
                                                            until_blank=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2286
        blocktext = match.string[:match.end()] + '\n'.join(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2287
        block = [escape2null(line) for line in block]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2288
        target = self.make_target(block, blocktext, lineno, '')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2289
        return [target], blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2290
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2291
    def line(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2292
        """Section title overline or transition marker."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2293
        if self.state_machine.match_titles:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2294
            return [match.string], 'Line', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2295
        elif match.string.strip() == '::':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2296
            raise statemachine.TransitionCorrection('text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2297
        elif len(match.string.strip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2298
            msg = self.reporter.info(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2299
                'Unexpected possible title overline or transition.\n'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2300
                "Treating it as ordinary text because it's so short.",
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2301
                line=self.state_machine.abs_line_number())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2302
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2303
            raise statemachine.TransitionCorrection('text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2304
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2305
            blocktext = self.state_machine.line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2306
            msg = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2307
                  'Unexpected section title or transition.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2308
                  nodes.literal_block(blocktext, blocktext),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2309
                  line=self.state_machine.abs_line_number())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2310
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2311
            return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2312
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2313
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2314
        """Titles, definition lists, paragraphs."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2315
        return [match.string], 'Text', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2316
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2317
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2318
class RFC2822Body(Body):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2319
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2320
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2321
    RFC2822 headers are only valid as the first constructs in documents.  As
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2322
    soon as anything else appears, the `Body` state should take over.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2323
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2324
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2325
    patterns = Body.patterns.copy()     # can't modify the original
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2326
    patterns['rfc2822'] = r'[!-9;-~]+:( +|$)'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2327
    initial_transitions = [(name, 'Body')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2328
                           for name in Body.initial_transitions]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2329
    initial_transitions.insert(-1, ('rfc2822', 'Body')) # just before 'text'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2330
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2331
    def rfc2822(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2332
        """RFC2822-style field list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2333
        fieldlist = nodes.field_list(classes=['rfc2822'])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2334
        self.parent += fieldlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2335
        field, blank_finish = self.rfc2822_field(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2336
        fieldlist += field
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2337
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2338
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2339
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2340
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2341
              node=fieldlist, initial_state='RFC2822List',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2342
              blank_finish=blank_finish)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2343
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2344
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2345
            self.parent += self.unindent_warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2346
                  'RFC2822-style field list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2347
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2348
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2349
    def rfc2822_field(self, match):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2350
        name = match.string[:match.string.find(':')]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2351
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2352
              self.state_machine.get_first_known_indented(match.end(),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2353
                                                          until_blank=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2354
        fieldnode = nodes.field()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2355
        fieldnode += nodes.field_name(name, name)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2356
        fieldbody = nodes.field_body('\n'.join(indented))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2357
        fieldnode += fieldbody
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2358
        if indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2359
            self.nested_parse(indented, input_offset=line_offset,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2360
                              node=fieldbody)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2361
        return fieldnode, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2362
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2363
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2364
class SpecializedBody(Body):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2365
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2366
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2367
    Superclass for second and subsequent compound element members.  Compound
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2368
    elements are lists and list-like constructs.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2369
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2370
    All transition methods are disabled (redefined as `invalid_input`).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2371
    Override individual methods in subclasses to re-enable.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2372
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2373
    For example, once an initial bullet list item, say, is recognized, the
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2374
    `BulletList` subclass takes over, with a "bullet_list" node as its
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2375
    container.  Upon encountering the initial bullet list item, `Body.bullet`
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2376
    calls its ``self.nested_list_parse`` (`RSTState.nested_list_parse`), which
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2377
    starts up a nested parsing session with `BulletList` as the initial state.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2378
    Only the ``bullet`` transition method is enabled in `BulletList`; as long
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2379
    as only bullet list items are encountered, they are parsed and inserted
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2380
    into the container.  The first construct which is *not* a bullet list item
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2381
    triggers the `invalid_input` method, which ends the nested parse and
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2382
    closes the container.  `BulletList` needs to recognize input that is
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2383
    invalid in the context of a bullet list, which means everything *other
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2384
    than* bullet list items, so it inherits the transition list created in
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2385
    `Body`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2386
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2387
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2388
    def invalid_input(self, match=None, context=None, next_state=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2389
        """Not a compound element member. Abort this state machine."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2390
        self.state_machine.previous_line() # back up so parent SM can reassess
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2391
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2392
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2393
    indent = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2394
    bullet = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2395
    enumerator = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2396
    field_marker = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2397
    option_marker = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2398
    doctest = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2399
    line_block = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2400
    grid_table_top = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2401
    simple_table_top = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2402
    explicit_markup = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2403
    anonymous = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2404
    line = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2405
    text = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2406
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2407
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2408
class BulletList(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2409
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2410
    """Second and subsequent bullet_list list_items."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2411
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2412
    def bullet(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2413
        """Bullet list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2414
        if match.string[0] != self.parent['bullet']:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2415
            # different bullet: new list
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2416
            self.invalid_input()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2417
        listitem, blank_finish = self.list_item(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2418
        self.parent += listitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2419
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2420
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2421
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2422
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2423
class DefinitionList(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2424
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2425
    """Second and subsequent definition_list_items."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2426
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2427
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2428
        """Definition lists."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2429
        return [match.string], 'Definition', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2430
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2431
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2432
class EnumeratedList(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2433
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2434
    """Second and subsequent enumerated_list list_items."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2435
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2436
    def enumerator(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2437
        """Enumerated list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2438
        format, sequence, text, ordinal = self.parse_enumerator(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2439
              match, self.parent['enumtype'])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2440
        if ( format != self.format
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2441
             or (sequence != '#' and (sequence != self.parent['enumtype']
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2442
                                      or self.auto
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2443
                                      or ordinal != (self.lastordinal + 1)))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2444
             or not self.is_enumerated_list_item(ordinal, sequence, format)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2445
            # different enumeration: new list
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2446
            self.invalid_input()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2447
        if sequence == '#':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2448
            self.auto = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2449
        listitem, blank_finish = self.list_item(match.end())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2450
        self.parent += listitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2451
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2452
        self.lastordinal = ordinal
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2453
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2454
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2455
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2456
class FieldList(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2457
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2458
    """Second and subsequent field_list fields."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2459
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2460
    def field_marker(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2461
        """Field list field."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2462
        field, blank_finish = self.field(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2463
        self.parent += field
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2464
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2465
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2466
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2467
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2468
class OptionList(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2469
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2470
    """Second and subsequent option_list option_list_items."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2471
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2472
    def option_marker(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2473
        """Option list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2474
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2475
            option_list_item, blank_finish = self.option_list_item(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2476
        except MarkupError, (message, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2477
            self.invalid_input()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2478
        self.parent += option_list_item
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2479
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2480
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2481
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2482
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2483
class RFC2822List(SpecializedBody, RFC2822Body):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2484
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2485
    """Second and subsequent RFC2822-style field_list fields."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2486
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2487
    patterns = RFC2822Body.patterns
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2488
    initial_transitions = RFC2822Body.initial_transitions
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2489
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2490
    def rfc2822(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2491
        """RFC2822-style field list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2492
        field, blank_finish = self.rfc2822_field(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2493
        self.parent += field
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2494
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2495
        return [], 'RFC2822List', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2496
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2497
    blank = SpecializedBody.invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2498
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2499
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2500
class ExtensionOptions(FieldList):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2501
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2502
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2503
    Parse field_list fields for extension options.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2504
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2505
    No nested parsing is done (including inline markup parsing).
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2506
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2507
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2508
    def parse_field_body(self, indented, offset, node):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2509
        """Override `Body.parse_field_body` for simpler parsing."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2510
        lines = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2511
        for line in list(indented) + ['']:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2512
            if line.strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2513
                lines.append(line)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2514
            elif lines:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2515
                text = '\n'.join(lines)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2516
                node += nodes.paragraph(text, text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2517
                lines = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2518
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2519
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2520
class LineBlock(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2521
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2522
    """Second and subsequent lines of a line_block."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2523
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2524
    blank = SpecializedBody.invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2525
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2526
    def line_block(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2527
        """New line of line block."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2528
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2529
        line, messages, blank_finish = self.line_block_line(match, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2530
        self.parent += line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2531
        self.parent.parent += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2532
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2533
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2534
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2535
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2536
class Explicit(SpecializedBody):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2537
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2538
    """Second and subsequent explicit markup construct."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2539
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2540
    def explicit_markup(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2541
        """Footnotes, hyperlink targets, directives, comments."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2542
        nodelist, blank_finish = self.explicit_construct(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2543
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2544
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2545
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2546
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2547
    def anonymous(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2548
        """Anonymous hyperlink targets."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2549
        nodelist, blank_finish = self.anonymous_target(match)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2550
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2551
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2552
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2553
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2554
    blank = SpecializedBody.invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2555
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2556
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2557
class SubstitutionDef(Body):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2558
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2559
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2560
    Parser for the contents of a substitution_definition element.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2561
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2562
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2563
    patterns = {
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2564
          'embedded_directive': re.compile(r'(%s)::( +|$)'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2565
                                           % Inliner.simplename, re.UNICODE),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2566
          'text': r''}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2567
    initial_transitions = ['embedded_directive', 'text']
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2568
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2569
    def embedded_directive(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2570
        nodelist, blank_finish = self.directive(match,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2571
                                                alt=self.parent['names'][0])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2572
        self.parent += nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2573
        if not self.state_machine.at_eof():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2574
            self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2575
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2576
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2577
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2578
        if not self.state_machine.at_eof():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2579
            self.blank_finish = self.state_machine.is_next_line_blank()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2580
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2581
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2582
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2583
class Text(RSTState):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2584
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2585
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2586
    Classifier of second line of a text block.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2587
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2588
    Could be a paragraph, a definition list item, or a title.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2589
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2590
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2591
    patterns = {'underline': Body.patterns['line'],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2592
                'text': r''}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2593
    initial_transitions = [('underline', 'Body'), ('text', 'Body')]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2594
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2595
    def blank(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2596
        """End of paragraph."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2597
        paragraph, literalnext = self.paragraph(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2598
              context, self.state_machine.abs_line_number() - 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2599
        self.parent += paragraph
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2600
        if literalnext:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2601
            self.parent += self.literal_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2602
        return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2603
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2604
    def eof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2605
        if context:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2606
            self.blank(None, context, None)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2607
        return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2608
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2609
    def indent(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2610
        """Definition list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2611
        definitionlist = nodes.definition_list()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2612
        definitionlistitem, blank_finish = self.definition_list_item(context)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2613
        definitionlist += definitionlistitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2614
        self.parent += definitionlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2615
        offset = self.state_machine.line_offset + 1   # next line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2616
        newline_offset, blank_finish = self.nested_list_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2617
              self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2618
              input_offset=self.state_machine.abs_line_offset() + 1,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2619
              node=definitionlist, initial_state='DefinitionList',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2620
              blank_finish=blank_finish, blank_finish_state='Definition')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2621
        self.goto_line(newline_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2622
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2623
            self.parent += self.unindent_warning('Definition list')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2624
        return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2625
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2626
    def underline(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2627
        """Section title."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2628
        lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2629
        title = context[0].rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2630
        underline = match.string.rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2631
        source = title + '\n' + underline
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2632
        messages = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2633
        if column_width(title) > len(underline):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2634
            if len(underline) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2635
                if self.state_machine.match_titles:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2636
                    msg = self.reporter.info(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2637
                        'Possible title underline, too short for the title.\n'
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2638
                        "Treating it as ordinary text because it's so short.",
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2639
                        line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2640
                    self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2641
                raise statemachine.TransitionCorrection('text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2642
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2643
                blocktext = context[0] + '\n' + self.state_machine.line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2644
                msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2645
                    'Title underline too short.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2646
                    nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2647
                messages.append(msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2648
        if not self.state_machine.match_titles:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2649
            blocktext = context[0] + '\n' + self.state_machine.line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2650
            msg = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2651
                'Unexpected section title.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2652
                nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2653
            self.parent += messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2654
            self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2655
            return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2656
        style = underline[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2657
        context[:] = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2658
        self.section(title, source, style, lineno - 1, messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2659
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2660
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2661
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2662
        """Paragraph."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2663
        startline = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2664
        msg = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2665
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2666
            block = self.state_machine.get_text_block(flush_left=1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2667
        except statemachine.UnexpectedIndentationError, instance:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2668
            block, source, lineno = instance.args
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2669
            msg = self.reporter.error('Unexpected indentation.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2670
                                      source=source, line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2671
        lines = context + list(block)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2672
        paragraph, literalnext = self.paragraph(lines, startline)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2673
        self.parent += paragraph
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2674
        self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2675
        if literalnext:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2676
            try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2677
                self.state_machine.next_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2678
            except EOFError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2679
                pass
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2680
            self.parent += self.literal_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2681
        return [], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2682
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2683
    def literal_block(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2684
        """Return a list of nodes."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2685
        indented, indent, offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2686
              self.state_machine.get_indented()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2687
        while indented and not indented[-1].strip():
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2688
            indented.trim_end()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2689
        if not indented:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2690
            return self.quoted_literal_block()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2691
        data = '\n'.join(indented)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2692
        literal_block = nodes.literal_block(data, data)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2693
        literal_block.line = offset + 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2694
        nodelist = [literal_block]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2695
        if not blank_finish:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2696
            nodelist.append(self.unindent_warning('Literal block'))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2697
        return nodelist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2698
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2699
    def quoted_literal_block(self):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2700
        abs_line_offset = self.state_machine.abs_line_offset()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2701
        offset = self.state_machine.line_offset
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2702
        parent_node = nodes.Element()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2703
        new_abs_offset = self.nested_parse(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2704
            self.state_machine.input_lines[offset:],
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2705
            input_offset=abs_line_offset, node=parent_node, match_titles=0,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2706
            state_machine_kwargs={'state_classes': (QuotedLiteralBlock,),
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2707
                                  'initial_state': 'QuotedLiteralBlock'})
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2708
        self.goto_line(new_abs_offset)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2709
        return parent_node.children
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2710
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2711
    def definition_list_item(self, termline):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2712
        indented, indent, line_offset, blank_finish = \
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2713
              self.state_machine.get_indented()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2714
        definitionlistitem = nodes.definition_list_item(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2715
            '\n'.join(termline + list(indented)))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2716
        lineno = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2717
        definitionlistitem.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2718
        termlist, messages = self.term(termline, lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2719
        definitionlistitem += termlist
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2720
        definition = nodes.definition('', *messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2721
        definitionlistitem += definition
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2722
        if termline[0][-2:] == '::':
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2723
            definition += self.reporter.info(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2724
                  'Blank line missing before literal block (after the "::")? '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2725
                  'Interpreted as a definition list item.', line=line_offset+1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2726
        self.nested_parse(indented, input_offset=line_offset, node=definition)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2727
        return definitionlistitem, blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2728
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2729
    classifier_delimiter = re.compile(' +: +')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2730
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2731
    def term(self, lines, lineno):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2732
        """Return a definition_list's term and optional classifiers."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2733
        assert len(lines) == 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2734
        text_nodes, messages = self.inline_text(lines[0], lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2735
        term_node = nodes.term()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2736
        node_list = [term_node]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2737
        for i in range(len(text_nodes)):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2738
            node = text_nodes[i]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2739
            if isinstance(node, nodes.Text):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2740
                parts = self.classifier_delimiter.split(node.rawsource)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2741
                if len(parts) == 1:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2742
                    node_list[-1] += node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2743
                else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2744
                    
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2745
                    node_list[-1] += nodes.Text(parts[0].rstrip())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2746
                    for part in parts[1:]:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2747
                        classifier_node = nodes.classifier('', part)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2748
                        node_list.append(classifier_node)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2749
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2750
                node_list[-1] += node
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2751
        return node_list, messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2752
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2753
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2754
class SpecializedText(Text):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2755
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2756
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2757
    Superclass for second and subsequent lines of Text-variants.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2758
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2759
    All transition methods are disabled. Override individual methods in
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2760
    subclasses to re-enable.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2761
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2762
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2763
    def eof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2764
        """Incomplete construct."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2765
        return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2766
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2767
    def invalid_input(self, match=None, context=None, next_state=None):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2768
        """Not a compound element member. Abort this state machine."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2769
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2770
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2771
    blank = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2772
    indent = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2773
    underline = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2774
    text = invalid_input
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2775
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2776
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2777
class Definition(SpecializedText):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2778
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2779
    """Second line of potential definition_list_item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2780
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2781
    def eof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2782
        """Not a definition."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2783
        self.state_machine.previous_line(2) # so parent SM can reassess
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2784
        return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2785
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2786
    def indent(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2787
        """Definition list item."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2788
        definitionlistitem, blank_finish = self.definition_list_item(context)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2789
        self.parent += definitionlistitem
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2790
        self.blank_finish = blank_finish
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2791
        return [], 'DefinitionList', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2792
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2793
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2794
class Line(SpecializedText):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2795
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2796
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2797
    Second line of over- & underlined section title or transition marker.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2798
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2799
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2800
    eofcheck = 1                        # @@@ ???
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2801
    """Set to 0 while parsing sections, so that we don't catch the EOF."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2802
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2803
    def eof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2804
        """Transition marker at end of section or document."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2805
        marker = context[0].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2806
        if self.memo.section_bubble_up_kludge:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2807
            self.memo.section_bubble_up_kludge = 0
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2808
        elif len(marker) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2809
            self.state_correction(context)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2810
        if self.eofcheck:               # ignore EOFError with sections
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2811
            lineno = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2812
            transition = nodes.transition(rawsource=context[0])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2813
            transition.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2814
            self.parent += transition
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2815
        self.eofcheck = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2816
        return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2817
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2818
    def blank(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2819
        """Transition marker."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2820
        lineno = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2821
        marker = context[0].strip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2822
        if len(marker) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2823
            self.state_correction(context)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2824
        transition = nodes.transition(rawsource=marker)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2825
        transition.line = lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2826
        self.parent += transition
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2827
        return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2828
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2829
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2830
        """Potential over- & underlined title."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2831
        lineno = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2832
        overline = context[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2833
        title = match.string
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2834
        underline = ''
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2835
        try:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2836
            underline = self.state_machine.next_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2837
        except EOFError:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2838
            blocktext = overline + '\n' + title
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2839
            if len(overline.rstrip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2840
                self.short_overline(context, blocktext, lineno, 2)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2841
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2842
                msg = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2843
                    'Incomplete section title.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2844
                    nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2845
                self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2846
                return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2847
        source = '%s\n%s\n%s' % (overline, title, underline)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2848
        overline = overline.rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2849
        underline = underline.rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2850
        if not self.transitions['underline'][0].match(underline):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2851
            blocktext = overline + '\n' + title + '\n' + underline
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2852
            if len(overline.rstrip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2853
                self.short_overline(context, blocktext, lineno, 2)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2854
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2855
                msg = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2856
                    'Missing matching underline for section title overline.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2857
                    nodes.literal_block(source, source), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2858
                self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2859
                return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2860
        elif overline != underline:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2861
            blocktext = overline + '\n' + title + '\n' + underline
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2862
            if len(overline.rstrip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2863
                self.short_overline(context, blocktext, lineno, 2)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2864
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2865
                msg = self.reporter.severe(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2866
                      'Title overline & underline mismatch.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2867
                      nodes.literal_block(source, source), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2868
                self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2869
                return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2870
        title = title.rstrip()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2871
        messages = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2872
        if column_width(title) > len(overline):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2873
            blocktext = overline + '\n' + title + '\n' + underline
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2874
            if len(overline.rstrip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2875
                self.short_overline(context, blocktext, lineno, 2)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2876
            else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2877
                msg = self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2878
                      'Title overline too short.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2879
                      nodes.literal_block(source, source), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2880
                messages.append(msg)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2881
        style = (overline[0], underline[0])
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2882
        self.eofcheck = 0               # @@@ not sure this is correct
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2883
        self.section(title.lstrip(), source, style, lineno + 1, messages)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2884
        self.eofcheck = 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2885
        return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2886
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2887
    indent = text                       # indented title
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2888
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2889
    def underline(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2890
        overline = context[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2891
        blocktext = overline + '\n' + self.state_machine.line
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2892
        lineno = self.state_machine.abs_line_number() - 1
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2893
        if len(overline.rstrip()) < 4:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2894
            self.short_overline(context, blocktext, lineno, 1)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2895
        msg = self.reporter.error(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2896
              'Invalid section title or transition marker.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2897
              nodes.literal_block(blocktext, blocktext), line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2898
        self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2899
        return [], 'Body', []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2900
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2901
    def short_overline(self, context, blocktext, lineno, lines=1):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2902
        msg = self.reporter.info(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2903
            'Possible incomplete section title.\nTreating the overline as '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2904
            "ordinary text because it's so short.", line=lineno)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2905
        self.parent += msg
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2906
        self.state_correction(context, lines)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2907
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2908
    def state_correction(self, context, lines=1):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2909
        self.state_machine.previous_line(lines)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2910
        context[:] = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2911
        raise statemachine.StateCorrection('Body', 'text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2912
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2913
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2914
class QuotedLiteralBlock(RSTState):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2915
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2916
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2917
    Nested parse handler for quoted (unindented) literal blocks.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2918
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2919
    Special-purpose.  Not for inclusion in `state_classes`.
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2920
    """
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2921
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2922
    patterns = {'initial_quoted': r'(%(nonalphanum7bit)s)' % Body.pats,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2923
                'text': r''}
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2924
    initial_transitions = ('initial_quoted', 'text')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2925
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2926
    def __init__(self, state_machine, debug=0):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2927
        RSTState.__init__(self, state_machine, debug)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2928
        self.messages = []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2929
        self.initial_lineno = None
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2930
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2931
    def blank(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2932
        if context:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2933
            raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2934
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2935
            return context, next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2936
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2937
    def eof(self, context):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2938
        if context:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2939
            text = '\n'.join(context)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2940
            literal_block = nodes.literal_block(text, text)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2941
            literal_block.line = self.initial_lineno
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2942
            self.parent += literal_block
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2943
        else:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2944
            self.parent += self.reporter.warning(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2945
                'Literal block expected; none found.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2946
                line=self.state_machine.abs_line_number())
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2947
            self.state_machine.previous_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2948
        self.parent += self.messages
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2949
        return []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2950
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2951
    def indent(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2952
        assert context, ('QuotedLiteralBlock.indent: context should not '
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2953
                         'be empty!')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2954
        self.messages.append(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2955
            self.reporter.error('Unexpected indentation.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2956
                                line=self.state_machine.abs_line_number()))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2957
        self.state_machine.previous_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2958
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2959
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2960
    def initial_quoted(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2961
        """Match arbitrary quote character on the first line only."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2962
        self.remove_transition('initial_quoted')
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2963
        quote = match.string[0]
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2964
        pattern = re.compile(re.escape(quote))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2965
        # New transition matches consistent quotes only:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2966
        self.add_transition('quoted',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2967
                            (pattern, self.quoted, self.__class__.__name__))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2968
        self.initial_lineno = self.state_machine.abs_line_number()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2969
        return [match.string], next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2970
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2971
    def quoted(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2972
        """Match consistent quotes on subsequent lines."""
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2973
        context.append(match.string)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2974
        return context, next_state, []
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2975
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2976
    def text(self, match, context, next_state):
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2977
        if context:
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2978
            self.messages.append(
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2979
                self.reporter.error('Inconsistent literal block quoting.',
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2980
                                    line=self.state_machine.abs_line_number()))
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2981
            self.state_machine.previous_line()
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2982
        raise EOFError
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2983
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2984
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2985
state_classes = (Body, BulletList, DefinitionList, EnumeratedList, FieldList,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2986
                 OptionList, LineBlock, ExtensionOptions, Explicit, Text,
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2987
                 Definition, Line, SubstitutionDef, RFC2822Body, RFC2822List)
d8ac696cc51f helium_7.0-r14027
wbernard
parents:
diff changeset
  2988
"""Standard set of State classes used to start `RSTStateMachine`."""