How to Get Dependency Parse Output from SyntaxNet
Asked Answered
F

1

10

How do you get a dependency parse (not syntax tree) output from SyntaxNet (https://github.com/tensorflow/models/tree/master/syntaxnet) ? I see a description of dependency parsing...a description of how to train a model, but not how to get dependency parse output.

Does SyntaxNet (Specifically the Parsey McParseface model) even do dependency parsing out of the box?

Fleabitten answered 13/5, 2016 at 21:28 Comment(0)
I
14

Passing --arg_prefix brain_parser to the parser_eval.py should do the trick. But this requires the tagged output to be fed as input.

Here's an example where the first pass tags the words and the second pass resolves dependencies:

echo 'The quick brown fox ran over the lazy dog.' | bazel-bin/syntaxnet/parser_eval \
--input stdin \
--output stdout-conll \
--model syntaxnet/models/parsey_mcparseface/tagger-params \
--task_context syntaxnet/models/parsey_mcparseface/context.pbtxt \
--hidden_layer_sizes 64 \
--arg_prefix brain_tagger \
--graph_builder structured \
--slim_model \
--batch_size 1024 | bazel-bin/syntaxnet/parser_eval \
--input stdin-conll \
--output stdout-conll \
--hidden_layer_sizes 512,512 \
--arg_prefix brain_parser \
--graph_builder structured \
--task_context syntaxnet/models/parsey_mcparseface/context.pbtxt \
--model_path syntaxnet/models/parsey_mcparseface/parser-params \
--slim_model --batch_size 1024

This generates the following output:

1       The     _       DET     DT      _       4       det     _       _
2       quick   _       ADJ     JJ      _       4       amod    _       _
3       brown   _       ADJ     JJ      _       4       amod    _       _
4       fox     _       NOUN    NN      _       5       nsubj   _       _
5       ran     _       VERB    VBD     _       0       ROOT    _       _
6       over    _       ADP     IN      _       5       prep    _       _
7       the     _       DET     DT      _       9       det     _       _
8       lazy    _       ADJ     JJ      _       9       amod    _       _
9       dog     _       NOUN    NN      _       6       pobj    _       _
10      .       _       .       .       _       5       punct   _       _
Isoagglutinin answered 15/5, 2016 at 10:47 Comment(7)
To add on to this, if you simply comment-out the bazel-bin/syntaxnet/conll2tree final pipe (and args) of the models/syntaxnet/syntaxnet/demo.sh you will get this result automatically.Fleabitten
I'm having trouble finding an in depth explanation of this output. What do the rows of dashes stand for? Also what's the purpose of the row of numbers( Not 1 - 10. The other one )Polka
@Kahless: See ilk.uvt.nl/conll/#dataformat for field definitions.Isoagglutinin
@Isoagglutinin is there some kind of man page or other documentation describing the possible values for arg_prefix?Kohlrabi
@matanster: The possible values are either brain_pos or brain_parser. Details can be found here: github.com/tensorflow/models/tree/master/syntaxnetIsoagglutinin
Is that possible with russian language too? According to pretrained models github.com/tensorflow/models/blob/master/syntaxnet/universal.mdVinavinaceous
@VicNicethemer: Yes, just download the model from download.tensorflow.org/models/parsey_universal/Russian.zip and refer to the appropriate files within the folder. You might not be able to feed the context though, as the context file seems to be not available for the Parsey's Cousins languages.Isoagglutinin

© 2022 - 2024 — McMap. All rights reserved.