aboutsummaryrefslogtreecommitdiffstats
path: root/docs/scripting/inlinescripts.rst
blob: 19e175824ebc3a70fc07188b98373d9049a6d2c6 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
.. _inlinescripts:

Inline Scripts
==============

**mitmproxy** has a powerful scripting API that allows you to modify flows
on-the-fly or rewrite previously saved flows locally.

The mitmproxy scripting API is event driven - a script is simply a Python
module that exposes a set of event methods. Here's a complete mitmproxy script
that adds a new header to every HTTP response before it is returned to the
client:

.. literalinclude:: ../../examples/add_header.py
   :caption: examples/add_header.py
   :language: python

The first argument to each event method is an instance of
:py:class:`~libmproxy.script.ScriptContext` that lets the script interact with the global mitmproxy
state. The **response** event also gets an instance of :py:class:`~libmproxy.script.ScriptContext`,
which we can use to manipulate the response itself.

We can now run this script using mitmdump or mitmproxy as follows:

>>> mitmdump -s add_header.py

The new header will be added to all responses passing through the proxy.

Examples
--------

mitmproxy comes with a variety of example inline scripts, which demonstrate many basic tasks.
We encourage you to either browse them locally or on `GitHub`_.


Events
------

.. TODO: Split this into Connection, HTTP and TCP events once we have TCP events.

The ``context`` argument passed to each event method is always a
:py:class:`~libmproxy.script.ScriptContext` instance. It is guaranteed to be the same object
for the scripts lifetime and is not shared between multiple inline scripts. You can safely use it
to store any form of state you require.

Events are listed in the order they usually occur.

.. py:function:: start(context, argv)

    Called once on startup, before any other events.

    :param List[str] argv: The inline scripts' arguments.
        For example, ``mitmproxy -s 'example.py --foo 42'`` sets argv to ``["--foo", "42"]``.

.. py:function:: clientconnect(context, root_layer)

    Called when a client initiates a connection to the proxy. Note that
    a connection can correspond to multiple HTTP requests.

    .. versionchanged:: 0.14

    :param Layer root_layer: The root layer (see :ref:`protocols` for an explanation what the root
        layer is), which provides transparent access to all attributes of the
        :py:class:`~libmproxy.proxy.RootContext`. For example, ``root_layer.client_conn.address``
        gives the remote address of the connecting client.


.. py:function:: request(context, flow)

    Called when a client request has been received. The ``flow`` object is
    guaranteed to have a non-None ``request`` attribute.

    :param HTTPFlow flow: The flow containing the request which has been received.
        The object is guaranteed to have a non-None ``request`` attribute.

.. py:function:: serverconnect(context, server_conn)

    Called before the proxy initiates a connection to the target server. Note that
    a connection can correspond to multiple HTTP requests.

    :param ServerConnection server_conn: The server connection object. It is guaranteed to have a
        non-None ``address`` attribute.

.. py:function:: responseheaders(context, flow)

    Called when the headers of a server response have been received.
    This will always be called before the response hook.

    :param HTTPFlow flow: The flow containing the request and response.
        The object is guaranteed to have non-None ``request`` and
        ``response`` attributes. ``response.content`` will be ``None``,
        as the response body has not been read yet.

.. py:function:: response(context, flow)

    Called when a server response has been received.

    :param HTTPFlow flow: The flow containing the request and response.
        The object is guaranteed to have non-None ``request`` and
        ``response`` attributes. ``response.body`` will contain the raw response body,
        unless response streaming has been enabled.

.. py:function:: error(context, flow)

    Called when a flow error has occurred, e.g. invalid server responses, or
    interrupted connections. This is distinct from a valid server HTTP error
    response, which is simply a response with an HTTP error code.

    :param HTTPFlow flow: The flow containing the error.
        It is guaranteed to have non-None ``error`` attribute.

.. py:function:: serverdisconnect(context, server_conn)

    Called when the proxy has closed the server connection.

    .. versionadded:: 0.14

    :param ServerConnection server_conn: see :py:func:`serverconnect`

.. py:function:: clientdisconnect(context, root_layer)

    Called when a client disconnects from the proxy.

    .. versionchanged:: 0.14

    :param Layer root_layer: see :py:func:`clientconnect`

.. py:function:: done(context)

    Called once on script shutdown, after any other events.


API
---

The canonical API documentation is the code, which you can browse here, locally or on `GitHub`_.
*Use the Source, Luke!*

The main classes you will deal with in writing mitmproxy scripts are:

:py:class:`~libmproxy.script.ScriptContext`
    - A handle for interacting with mitmproxy's Flow Master from within scripts.
:py:class:`~libmproxy.models.ClientConnection`
    - Describes a client connection.
:py:class:`~libmproxy.models.ServerConnection`
    - Describes a server connection.
:py:class:`~libmproxy.models.HTTPFlow`
    - A collection of objects representing a single HTTP transaction.
:py:class:`~libmproxy.models.HTTPRequest`
    - An HTTP request.
:py:class:`~libmproxy.models.HTTPResponse`
    - An HTTP response.
:py:class:`~libmproxy.models.Error`
    - A communications error.
:py:class:`netlib.http.Headers`
    - A dictionary-like object for managing HTTP headers.
:py:class:`netlib.certutils.SSLCert`
    - Exposes information SSL certificates.
:py:class:`libmproxy.flow.FlowMaster`
    - The "heart" of libmproxy, usually subclassed as :py:class:`libmproxy.dump.DumpMaster` or
      :py:class:`libmproxy.console.ConsoleMaster`.

Script Context
--------------

.. autoclass:: libmproxy.script.ScriptContext
    :members:
    :undoc-members:

Running scripts in parallel
---------------------------

We have a single flow primitive, so when a script is blocking, other requests are not processed.
While that's usually a very desirable behaviour, blocking scripts can be run threaded by using the
:py:obj:`libmproxy.script.concurrent` decorator.
**If your script does not block, you should avoid the overhead of the decorator.**

.. literalinclude:: ../../examples/nonblocking.py
   :caption: examples/nonblocking.py
   :language: python

Make scripts configurable with arguments
----------------------------------------

Sometimes, you want to pass runtime arguments to the inline script. This can be simply done by
surrounding the script call with quotes, e.g. ```mitmdump -s 'script.py --foo 42'``.
The arguments are then exposed in the start event:

.. literalinclude:: ../../examples/modify_response_body.py
   :caption: examples/modify_response_body.py
   :language: python

Running scripts on saved flows
------------------------------

Sometimes, we want to run a script on :py:class:`~libmproxy.models.Flow` objects that are already
complete.  This happens when you start a script, and then load a saved set of flows from a file
(see the "scripted data transformation" example `here <https://mitmproxy.org/doc/mitmdump.html>`_).
It also happens when you run a one-shot script on a single flow through the ``|`` (pipe) shortcut
in mitmproxy.

In this case, there are no client connections, and the events are run in the following order:
**start**, **request**, **responseheaders**, **response**, **error**, **done**.
If the flow doesn't have a **response** or **error** associated with it, the matching events will
be skipped.

Spaces in the script path
-------------------------

By default, spaces are interpreted as a separator between the inline script and its arguments
(e.g. ``-s 'foo.py 42'``). Consequently, the script path needs to be wrapped in a separate pair of
quotes if it contains spaces: ``-s '\'./foo bar/baz.py\' 42'``.

.. _GitHub: https://github.com/mitmproxy/mitmproxy