An optimized version of BERT that uses dynamic masking and larger mini-batches to "top" standard benchmarks. The Data (TOP): A dataset specifically designed for Task-Oriented Parsing
To recap: