Types in Python
Python's type system was specified in PEP 484. If you are new to Python's type system and want to learn the basics, we highly recommend you take a look at mypy's cheatsheet as well as their type system reference. The following discussion focuses on Pyre's approach to "gradual typing" and how you can get from an untyped codebase to a fully typed codebase.
Why Types?
Gradual Typing
Most Python code does not (yet) start out typed. PEP 484 specifies a gradual type system, which is built to allow you to gradually add annotations over time. It does so by
- only reporting errors on functions that have an explicit return or parameter type annotation,
- introducing an escape hatch: a special type
Any
that has all possible attributes and is both sub- and super-type of any other type, - and assuming that all untyped fuctions implicitly return
Any
.
For example,
from typing import List
def unannotated(): # implictly returns `Any`
return b"" + "" # function body is not checked
def annotated() -> List: # explicit return annotation means we type check `annotated`
any = unannotated()
any.attribute # `Any` has all possible attributes
return 1 # Error: returning `int` but expecting `List`
In combination, these rules allow you to slowly annotate code without getting overwhelmed by type errors in one sitting. Incrementally adding more annotations will give you stronger safety and consistency guarantees in your codebase.
In the example above, if you changed unannotated
to return str
, you would get a type error when accessing the attribute any.attribute
in annotated
.
Strict Mode
While Any
is a necessary escape hatch when annotating large codebases over time, it can hide legitimate type errors. We've introduced strict mode in Pyre to address this problem. Strict mode can be toggled at a module level by introducing a # pyre-strict
comment to the file. In strict mode, Pyre will
- run on all functions, whether they are annotated or not,
- error on functions, globals, or attributes that are missing annotations,
- and error on annotations containing
Any
(with some exceptions to accommodate for common patterns).
In our previous example,
# pyre-strict
from typing import List
def unannotated(): # Error: missing return annotation
return b"" + "" # Error: function body *is* checked
def annotated() -> List: # Error: implicit `Any` for generic parameter to `List`
any = unannotated()
any.attribute # Note: the type of `any` is still any.
return 1 # Error: returning `int` but expecting `List`
As you can see in the example, Any
can still sneak into modules that are strict, but increasing strict coverage and fixing the surfaced errors will gradually eliminate them.
Strict-By-Default
Strict mode can also be set as the default in a project configuration. To opt individual files out of strict mode, use # pyre-unsafe
in place of # pyre-strict
.
How to move away from Any
What if I want to use Any
?
It may be tempting to annotate a generic function parameter with Any
. But while it is a convenient utility for quickly annotating untyped code, it has no place in a strict-mode codebase. The main problem with Any
is that it unifies with every type - which effectively hides all potential type errors that could stem from incorrect usage of anything annotated with Any
. It is not, therefore, a good idea to use it in generic code.
What then?
We have two main methods for annotating generic code: object
and TypeVar
. As it turns out, it is not at all obvious which of them should be used where. However, the general TL;DR is that object
is an opaque superclass of all types, while TypeVar
is for preserving a type across one or more function calls.
About object
The advantage of using object
over Any
is that while any type can be “put inside” it, it is an error to use it as any type other than object
. This can be useful everywhere we need type erasure, like (de)serialization or generic heterogeneous containers where there is no obvious common supertype. Thanks to Python’s runtime reflection, the original type can be recovered, e.g. through isinstance
. Note that such checks are valid only for a short time, see here.
About TypeVar
TypeVar
s are somewhat interesting beasts, as in a vacuum, they can behave both like Any
and like object
. Like object
, they accept all types, but within a single typecheck (i.e. one line/function call/operation) they remember what type they were. This makes them useful for e.g. linking the parameters and return types of a function, or class attribute types with its method signatures. There are, however, a couple of caveats. It doesn’t make sense to use a TypeVar on a function or method when it’s only used in its parameters (just use the most general known supertype instead).
Another story is the difference between TypeVar
s’ invariance, variance and contravariance, which is covered in here.
When Source Code is not Available
We do not always have access to all the source code that contributes type information to our project: e.g. builtins
is compiled native code, and other libraries may be using Cython. Other times, we may be working with Python code that is just too dynamic to be reasonably typed.
To address these cases, Pyre will give precedence to type stub files with a *.pyi
extension over source files when these are specified in the search path in the project configuration or if they are located next to the implementation file.
Stub files have the same structure as implementation files but only contain class and function signatures:
# my_dynamic_module.pyi
def dynamic_function() -> int: ... # Function body is omitted
If a __getattr__
function is defined in the stub file as follows, Pyre will take it as a signal that the stub file is partially complete: accessing attributes whose name is not defined in the stub file will result in Any
instead of a type error.
# my_stub.pyi
from typing import Any
foo: int = 42
# Parameter needs to be typed as `str` and return type needs to be `Any`
def __getattr__(name: str) -> Any: ...
# my_source.py
import my_stub
reveal_type(my_stub.foo) # Reveals `int`
reveal_type(my_stub.undefined) # Reveals `Any`
Typeshed
Strategies for Increasing Coverage
Pyre comes with tooling to make it easy to increase type coverage in your project.
Upgrade
When upgrading the type checker, new errors inevitably get surfaced. In order to keep a codebase clean through upgrades we've built pyre-upgrade
, which automatically suppresses newly surfaced type errors. It takes Pyre's output and adds supression comments to the code explaining what's wrong so that developers can easily address the issues individually.
You can run pyre-upgrade
with
(venv) $ pyre --output=json | pyre-upgrade fixme
or if you are using a local configuration
(venv) $ pyre --output=json -l <project> | pyre-upgrade fixme
Automatic Type Inference
We have found tools that automatically add type annotations to code useful to get started with a project. There are two general approaches to automatic type inference: static inference and dynamic inference from runtime information. Both approaches come with their own trade-offs and we have found a combination of the two to be useful.
Pyre can do static type inference. You can run
(venv) $ cd <path to project>; pyre infer -i
to automatically apply annotations.
For dynamic inference we recommend you give MonkeyType a try.