blob: 7d90642a9e582c487e21818445bb0045905c341e [file] [log] [blame] [view]
pwnall4ea2eb32016-11-29 02:47:251# Writing Layout Tests
2
3_Layout tests_ is a bit of a misnomer. This term is
4[a part of our WebKit heritage](https://webkit.org/blog/1452/layout-tests-theory/),
5and we use it to refer to every test that is written as a Web page (HTML, SVG,
6or XHTML) and lives in
7[third_party/WebKit/LayoutTests/](../../third_party/WebKit/LayoutTests).
8
9[TOC]
10
11## Overview
12
13Layout tests should be used to accomplish one of the following goals:
14
151. The entire surface of Blink that is exposed to the Web should be covered by
foolipeda32ab2017-02-16 19:21:5816 tests that we contribute to [web-platform-tests](./web_platform_tests.md)
17 (WPT). This helps us avoid regressions, and helps us identify Web Platform
18 areas where the major browsers don't have interoperable implementations.
19 Furthermore, by contributing to projects such as WPT, we share the burden of
20 writing tests with the other browser vendors, and we help all the browsers
21 get better. This is very much in line with our goal to move the Web forward.
pwnall4ea2eb32016-11-29 02:47:25222. When a Blink feature cannot be tested using the tools provided by WPT, and
23 cannot be easily covered by
Kent Tamura6943cf792018-04-09 05:24:5424 [C++ unit tests](https://cs.chromium.org/chromium/src/third_party/blink/renderer/web/tests/?q=webframetest&sq=package:chromium&type=cs),
pwnall4ea2eb32016-11-29 02:47:2525 the feature must be covered by layout tests, to avoid unexpected regressions.
26 These tests will use Blink-specific testing APIs that are only available in
27 [content_shell](./layout_tests_in_content_shell.md).
28
29*** promo
30If you know that Blink layout tests are upstreamed to other projects, such as
31[test262](https://github.com/tc39/test262), please update this document. Most
32importantly, our guidelines should to make it easy for our tests to be
pwnall6acacd82016-12-02 01:40:1533upstreamed. The
34[blink-dev mailing list](https://groups.google.com/a/chromium.org/forum/#!forum/blink-dev)
35will be happy to help you harmonize our current guidelines with communal test
36repositories.
pwnall4ea2eb32016-11-29 02:47:2537***
38
39### Test Types
40
41There are four broad types of layout tests, listed in the order of preference.
42
43* *JavaScript Tests* are the layout test implementation of
44 [xUnit tests](https://en.wikipedia.org/wiki/XUnit). These tests contain
45 assertions written in JavaScript, and pass if the assertions evaluate to
46 true.
47* *Reference Tests* render a test page and a reference page, and pass if the two
48 renderings are identical, according to a pixel-by-pixel comparison. These
49 tests are less robust, harder to debug, and significantly slower than
50 JavaScript tests, and are only used when JavaScript tests are insufficient,
51 such as when testing paint code.
52* *Pixel Tests* render a test page and compare the result against a pre-rendered
dpranked2b7d642017-01-15 04:00:2453 baseline image in the repository. Pixel tests are less robust than the
54 first two types, because the rendering of a page is influenced by
pwnall4ea2eb32016-11-29 02:47:2555 many factors such as the host computer's graphics card and driver, the
56 platform's text rendering system, and various user-configurable operating
57 system settings. For this reason, it is common for a pixel test to have a
dpranked2b7d642017-01-15 04:00:2458 different reference image for each platform that Blink is tested on, and
59 the reference images are
60 [quite cumbersome to manage](./layout_test_expectations.md). You
61 should only write a pixel test if you cannot use a reference test. By default
62 a pixel test will also dump the layout tree as text output, so they are
63 similar to ...
64* *Layout tree tests*, which output a textual representation of the layout
pwnall4ea2eb32016-11-29 02:47:2565 tree, which is the key data structure in Blink's page rendering system. The
dpranked2b7d642017-01-15 04:00:2466 test passes if the output matches a baseline text file in the repository.
67 Layout tree tests are used as a last resort to test the internal quirks of
68 the implementation, and they should be avoided in favor of one of the earlier
69 options.
pwnall59aadcb2017-01-26 23:27:2170
pwnall4ea2eb32016-11-29 02:47:2571## General Principles
72
pwnall6acacd82016-12-02 01:40:1573
74Tests should be written under the assumption that they will be upstreamed
pwnall59aadcb2017-01-26 23:27:2175to the WPT project. To this end, tests should follow the
foolipeda32ab2017-02-16 19:21:5876[WPT guidelines](http://web-platform-tests.org/writing-tests/).
pwnall6acacd82016-12-02 01:40:1577
pwnall6acacd82016-12-02 01:40:1578
pwnall59aadcb2017-01-26 23:27:2179There is no style guide that applies to all layout tests. However, some projects
80have adopted style guides, such as the
81[ServiceWorker Tests Style guide](https://www.chromium.org/blink/serviceworker/testing).
pwnall6acacd82016-12-02 01:40:1582
pwnall59aadcb2017-01-26 23:27:2183Our [document on layout tests tips](./layout_tests_tips.md) summarizes the most
84important WPT guidelines and highlights some JavaScript concepts that are worth
85paying attention to when trying to infer style rules from existing tests. If
86you're unopinionated and looking for a style guide to follow, the document also
87suggests some defaults.
pwnall6acacd82016-12-02 01:40:1588
pwnall4ea2eb32016-11-29 02:47:2589## JavaScript Tests
90
91Whenever possible, the testing criteria should be expressed in JavaScript. The
92alternatives, which will be described in future sections, result in slower and
93less reliable tests.
94
95All new JavaScript tests should be written using the
mekb330e312017-05-03 19:56:2296[testharness.js](https://github.com/w3c/web-platform-tests/tree/master/resources)
97testing framework. This framework is used by the tests in the
pwnall4ea2eb32016-11-29 02:47:2598[web-platform-tests](https://github.com/w3c/web-platform-tests) repository,
99which is shared with all the other browser vendors, so `testharness.js` tests
100are more accessible to browser developers.
101
mekb330e312017-05-03 19:56:22102See the [API documentation](http://web-platform-tests.org/writing-tests/testharness-api.html)
foolipeda32ab2017-02-16 19:21:58103for a thorough introduction to `testharness.js`.
104
105Layout tests should follow the recommendations of the above documentation.
pwnall4ea2eb32016-11-29 02:47:25106Furthermore, layout tests should include relevant
foolipeda32ab2017-02-16 19:21:58107[metadata](http://web-platform-tests.org/writing-tests/css-metadata.html). The
pwnall4ea2eb32016-11-29 02:47:25108specification URL (in `<link rel="help">`) is almost always relevant, and is
109incredibly helpful to a developer who needs to understand the test quickly.
110
111Below is a skeleton for a JavaScript test embedded in an HTML page. Note that,
112in order to follow the minimality guideline, the test omits the tags `<html>`,
113`<head>`, and `<body>`, as they can be inferred by the HTML parser.
114
115```html
116<!doctype html>
pwnall59aadcb2017-01-26 23:27:21117<title>JavaScript: the true literal is immutable and equal to itself</title>
pwnall4ea2eb32016-11-29 02:47:25118<link rel="help" href="https://tc39.github.io/ecma262/#sec-boolean-literals">
pwnall4ea2eb32016-11-29 02:47:25119<script src="/resources/testharness.js"></script>
120<script src="/resources/testharnessreport.js"></script>
121<script>
122'use strict';
123
124// Synchronous test example.
125test(() => {
126 const value = true;
127 assert_true(value, 'true literal');
128 assert_equals(value.toString(), 'true', 'the string representation of true');
129}, 'The literal true in a synchronous test case');
130
131// Asynchronous test example.
132async_test(t => {
133 const originallyTrue = true;
134 setTimeout(t.step_func_done(() => {
135 assert_equals(originallyTrue, true);
136 }), 0);
137}, 'The literal true in a setTimeout callback');
138
139// Promise test example.
140promise_test(() => {
141 return new Promise((resolve, reject) => {
142 resolve(true);
143 }).then(value => {
144 assert_true(value);
145 });
146}, 'The literal true used to resolve a Promise');
147
148</script>
149```
150
151Some points that are not immediately obvious from the example:
152
pwnall4ea2eb32016-11-29 02:47:25153* When calling an `assert_` function that compares two values, the first
154 argument is the actual value (produced by the functionality being tested), and
155 the second argument is the expected value (known good, golden). The order
156 is important, because the testing harness relies on it to generate expressive
157 error messages that are relied upon when debugging test failures.
158* The assertion description (the string argument to `assert_` methods) conveys
159 the way the actual value was obtained.
160 * If the expected value doesn't make it clear, the assertion description
161 should explain the desired behavior.
162 * Test cases with a single assertion should omit the assertion's description
163 when it is sufficiently clear.
164* Each test case describes the circumstance that it tests, without being
165 redundant.
166 * Do not start test case descriptions with redundant terms like "Testing"
167 or "Test for".
ktyliue0bb9882017-01-10 01:47:50168 * Test files with a single test case should omit the test case description.
169 The file's `<title>` should be sufficient to describe the scenario being
170 tested.
pwnall4ea2eb32016-11-29 02:47:25171* Asynchronous tests have a few subtleties.
172 * The `async_test` wrapper calls its function with a test case argument that
173 is used to signal when the test case is done, and to connect assertion
174 failures to the correct test.
175 * `t.done()` must be called after all the test case's assertions have
176 executed.
177 * Test case assertions (actually, any callback code that can throw
178 exceptions) must be wrapped in `t.step_func()` calls, so that
179 assertion failures and exceptions can be traced back to the correct test
180 case.
181 * `t.step_func_done()` is a shortcut that combines `t.step_func()` with a
182 `t.done()` call.
183
184*** promo
185Layout tests that load from `file://` origins must currently use relative paths
186to point to
187[/resources/testharness.js](../../third_party/WebKit/LayoutTests/resources/testharness.js)
188and
189[/resources/testharnessreport.js](../../third_party/WebKit/LayoutTests/resources/testharnessreport.js).
190This is contrary to the WPT guidelines, which call for absolute paths.
191This limitation does not apply to the tests in `LayoutTests/http`, which rely on
foolip339204d2017-01-27 21:10:17192an HTTP server, or to the tests in `LayoutTests/external/wpt`, which are
pwnall4ea2eb32016-11-29 02:47:25193imported from the [WPT repository](https://github.com/w3c/web-platform-tests).
194***
195
196### WPT Supplemental Testing APIs
197
198Some tests simply cannot be expressed using the Web Platform APIs. For example,
199some tests that require a user to perform a gesture, such as a mouse click,
200cannot be implemented using Web APIs. The WPT project covers some of these cases
201via supplemental testing APIs.
202
pwnall59aadcb2017-01-26 23:27:21203When writing tests that rely on supplemental testing APIs, please consider the
204cost and benefits of having the tests
205[gracefully degrade to manual tests](./layout_tests_with_manual_fallback.md) in
206the absence of the testing APIs.
207
pwnall4ea2eb32016-11-29 02:47:25208*** promo
209In many cases, the user gesture is not actually necessary. For example, many
210event handling tests can use
211[synthetic events](https://developer.mozilla.org/docs/Web/Guide/Events/Creating_and_triggering_events).
212***
213
pwnall4ea2eb32016-11-29 02:47:25214### Relying on Blink-Specific Testing APIs
215
216Tests that cannot be expressed using the Web Platform APIs or WPT's testing APIs
217use Blink-specific testing APIs. These APIs are only available in
218[content_shell](./layout_tests_in_content_shell.md), and should only be used as
219a last resort.
220
221A downside of Blink-specific APIs is that they are not as well documented as the
222Web Platform features. Learning to use a Blink-specific feature requires finding
223other tests that use it, or reading its source code.
224
225For example, the most popular Blink-specific API is `testRunner`, which is
226implemented in
Euisang Lime2f2e782018-05-09 05:04:06227[content/shell/test_runner/test_runner.h](../../content/shell/test_runner/test_runner.h)
pwnall4ea2eb32016-11-29 02:47:25228and
Euisang Lime2f2e782018-05-09 05:04:06229[content/shell/test_runner/test_runner.cc](../../content/shell/test_runner/test_runner.cc).
pwnall4ea2eb32016-11-29 02:47:25230By skimming the `TestRunnerBindings::Install` method, we learn that the
Xianzhu Wangaf4fa412018-05-14 21:26:52231testRunner API is presented by the `.testRunner` etc. objects. Reading the
pwnall4ea2eb32016-11-29 02:47:25232`TestRunnerBindings::GetObjectTemplateBuilder` method tells us what properties
Xianzhu Wangaf4fa412018-05-14 21:26:52233are available on the `testRunner` object.
pwnall4ea2eb32016-11-29 02:47:25234
Xianzhu Wangaf4fa412018-05-14 21:26:52235Another popular Blink-specific API 'internals' defined in
236[third_party/blink/renderer/core/testing/internals.idl](../../third_party/blink/renderer/core/testing/internals.idl)
237contains more direct access to blink internals.
238
239*** note
240If possible, a test using blink-specific testing APIs should be written not to
241depend on the APIs, so that it can also work directly in a browser. If the test
242does need the APIs to work, it should still check if the API is available before
243using the API. Note that though we omit the `window.` prefix when using the
244APIs, we should use the qualified name in the `if` statement:
245```javascript
246 if (window.testRunner)
247 testRunner.waitUntilDone();
248```
pwnall4ea2eb32016-11-29 02:47:25249***
250
251*** note
252`testRunner` is the most popular testing API because it is also used indirectly
253by tests that stick to Web Platform APIs. The `testharnessreport.js` file in
254`testharness.js` is specifically designated to hold glue code that connects
255`testharness.js` to the testing environment. Our implementation is in
256[third_party/WebKit/LayoutTests/resources/testharnessreport.js](../../third_party/WebKit/LayoutTests/resources/testharnessreport.js),
257and uses the `testRunner` API.
258***
259
Euisang Lime2f2e782018-05-09 05:04:06260See the [content/shell/test_runner/](../../content/shell/test_runner/) directory and
pwnall4ea2eb32016-11-29 02:47:25261[WebKit's LayoutTests guide](https://trac.webkit.org/wiki/Writing%20Layout%20Tests%20for%20DumpRenderTree)
Xianzhu Wangaf4fa412018-05-14 21:26:52262for other useful APIs. For example, `eventSender`
Euisang Lime2f2e782018-05-09 05:04:06263([content/shell/test_runner/event_sender.h](../../content/shell/test_runner/event_sender.h)
pwnall4ea2eb32016-11-29 02:47:25264and
Euisang Lime2f2e782018-05-09 05:04:06265[content/shell/test_runner/event_sender.cc](../../content/shell/test_runner/event_sender.cc))
pwnall4ea2eb32016-11-29 02:47:25266has methods that simulate events input such as keyboard / mouse input and
267drag-and-drop.
268
269Here is a UML diagram of how the `testRunner` bindings fit into Chromium.
270
271[![UML of testRunner bindings configuring platform implementation](https://docs.google.com/drawings/u/1/d/1KNRNjlxK0Q3Tp8rKxuuM5mpWf4OJQZmvm9_kpwu_Wwg/export/svg?id=1KNRNjlxK0Q3Tp8rKxuuM5mpWf4OJQZmvm9_kpwu_Wwg&pageid=p)](https://docs.google.com/drawings/d/1KNRNjlxK0Q3Tp8rKxuuM5mpWf4OJQZmvm9_kpwu_Wwg/edit)
pwnall6acacd82016-12-02 01:40:15272
pwnall4ea2eb32016-11-29 02:47:25273### Text Test Baselines
274
275By default, all the test cases in a file that uses `testharness.js` are expected
276to pass. However, in some cases, we prefer to add failing test cases to the
277repository, so that we can be notified when the failure modes change (e.g., we
278want to know if a test starts crashing rather than returning incorrect output).
279In these situations, a test file will be accompanied by a baseline, which is an
280`-expected.txt` file that contains the test's expected output.
281
282The baselines are generated automatically when appropriate by
Kent Tamuraa045a7f2018-04-25 05:08:11283`run_web_tests.py`, which is described [here](./layout_tests.md), and by the
pwnall4ea2eb32016-11-29 02:47:25284[rebaselining tools](./layout_test_expectations.md).
285
286Text baselines for `testharness.js` should be avoided, as having a text baseline
287associated with a `testharness.js` indicates the presence of a bug. For this
288reason, CLs that add text baselines must include a
289[crbug.com](https://crbug.com) link for an issue tracking the removal of the
290text expectations.
291
292* When creating tests that will be upstreamed to WPT, and Blink's current
293 behavior does not match the specification that is being tested, a text
294 baseline is necessary. Remember to create an issue tracking the expectation's
295 removal, and to link the issue in the CL description.
296* Layout tests that cannot be upstreamed to WPT should use JavaScript to
297 document Blink's current behavior, rather than using JavaScript to document
298 desired behavior and a text file to document current behavior.
299
300### The js-test.js Legacy Harness
301
302*** promo
303For historical reasons, older tests are written using the `js-test` harness.
304This harness is **deprecated**, and should not be used for new tests.
305***
306
307If you need to understand old tests, the best `js-test` documentation is its
308implementation at
309[third_party/WebKit/LayoutTests/resources/js-test.js](../../third_party/WebKit/LayoutTests/resources/js-test.js).
310
311`js-test` tests lean heavily on the Blink-specific `testRunner` testing API.
312In a nutshell, the tests call `testRunner.dumpAsText()` to signal that the page
313content should be dumped and compared against a text baseline (an
314`-expected.txt` file). As a consequence, `js-test` tests are always accompanied
315by text baselines. Asynchronous tests also use `testRunner.waitUntilDone()` and
316`testRunner.notifyDone()` to tell the testing tools when they are complete.
317
318### Tests that use an HTTP Server
319
320By default, tests are loaded as if via `file:` URLs. Some web platform features
321require tests served via HTTP or HTTPS, for example absolute paths (`src=/foo`)
322or features restricted to secure protocols.
323
324HTTP tests are those under `LayoutTests/http/tests` (or virtual variants). Use a
325locally running HTTP server (Apache) to run them. Tests are served off of ports
3268000 and 8080 for HTTP, and 8443 for HTTPS. If you run the tests using
Kent Tamuraa045a7f2018-04-25 05:08:11327`run_web_tests.py`, the server will be started automatically. To run the server
pwnall4ea2eb32016-11-29 02:47:25328manually to reproduce or debug a failure:
329
330```bash
Kent Tamurae81dbff2018-04-20 17:35:34331cd src/third_party/blink/tools
332./run_blink_httpd.py
pwnall4ea2eb32016-11-29 02:47:25333```
334
335The layout tests will be served from `http://127.0.0.1:8000`. For example, to
336run the test `http/tests/serviceworker/chromium/service-worker-allowed.html`,
337navigate to
338`http://127.0.0.1:8000/serviceworker/chromium/service-worker-allowed.html`. Some
339tests will behave differently if you go to 127.0.0.1 instead of localhost, so
340use 127.0.0.1.
341
Kent Tamurae81dbff2018-04-20 17:35:34342To kill the server, hit any key on the terminal where `run_blink_httpd.py` is
Hajime Hoshia6fad022017-08-01 17:57:58343running, or just use `taskkill` or the Task Manager on Windows, and `killall` or
344Activity Monitor on MacOS.
pwnall4ea2eb32016-11-29 02:47:25345
346The test server sets up an alias to the `LayoutTests/resources` directory. In
347HTTP tests, you can access the testing framework at e.g.
pwnalle7819482016-12-17 00:58:40348`src="/resources/testharness.js"`.
pwnall4ea2eb32016-11-29 02:47:25349
350TODO: Document [wptserve](http://wptserve.readthedocs.io/) when we are in a
351position to use it to run layout tests.
352
353## Reference Tests (Reftests)
354
355Reference tests, also known as reftests, perform a pixel-by-pixel comparison
356between the rendered image of a test page and the rendered image of a reference
357page. Most reference tests pass if the two images match, but there are cases
358where it is useful to have a test pass when the two images do _not_ match.
359
360Reference tests are more difficult to debug than JavaScript tests, and tend to
361be slower as well. Therefore, they should only be used for functionality that
362cannot be covered by JavaScript tests.
363
364New reference tests should follow the
foolipeda32ab2017-02-16 19:21:58365[WPT reftests guidelines](http://web-platform-tests.org/writing-tests/reftests.html).
366The most important points are summarized below.
pwnall4ea2eb32016-11-29 02:47:25367
pwnall6acacd82016-12-02 01:40:15368* &#x1F6A7; The test page declares the reference page using a
369 `<link rel="match">` or `<link rel="mismatch">`, depending on whether the test
370 passes when the test image matches or does not match the reference image.
pwnall4ea2eb32016-11-29 02:47:25371* The reference page must not use the feature being tested. Otherwise, the test
372 is meaningless.
373* The reference page should be as simple as possible, and should not depend on
374 advanced features. Ideally, the reference page should render as intended even
375 on browsers with poor CSS support.
376* Reference tests should be self-describing.
377* Reference tests do _not_ include `testharness.js`.
378
pwnall6acacd82016-12-02 01:40:15379&#x1F6A7; Our testing infrastructure was designed for the
pwnall4ea2eb32016-11-29 02:47:25380[WebKit reftests](https://trac.webkit.org/wiki/Writing%20Reftests) that Blink
381has inherited. The consequences are summarized below.
382
383* Each reference page must be in the same directory as its associated test.
384 Given a test page named `foo` (e.g. `foo.html` or `foo.svg`),
385 * The reference page must be named `foo-expected` (e.g.,
386 `foo-expected.html`) if the test passes when the two images match.
387 * The reference page must be named `foo-expected-mismatch` (e.g.,
388 `foo-expected-mismatch.svg`) if the test passes when the two images do
389 _not_ match.