Settings

Theme

Q: Whats the Relationship Between Transformer Network Size and Task Performance

1 points by syntex 3 years ago · 0 comments · 1 min read

Reader

I'm exploring the relationship between the number of parameters in Transformer networks and the range of tasks they can perform. Basically I would like to know that networks of these order of magnitude in size are capable of doing "X" type of tasks. I also wonder what are capable of very small networks. Do you know any good sources, papers, or discussions about that.

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection