Representation learning or Vector Embedding #101

siddhartha-gadgil · 2016-09-13T07:57:29Z

There are many ways in which, given a (weighted) collection of linear terms, we can generate weighted graphs. Namely, we traverse based on:

combinations of terms: function application, lambdas etc.
terms joined to their types.
alternatively, terms mapped to types to give relations between types only.
ordering, sectioning etc of sources.

Each of these give representations of terms and types as vectors. We can use these for various decisions and maps, e.g.

deciding whether a statement is a priori likely to be true - so we can recognize strong statements.
map statements to likely ingredients in their proofs.
basic evaluation of a statement, better than just complexity.

siddhartha-gadgil · 2016-11-17T04:44:50Z

Bird2Vec lessons:

no magic in word2vec;
should first of all use type information;
should learn vectors with actual predictions, first, not synthetic ones.

siddhartha-gadgil · 2017-03-02T10:07:51Z

Is not really an issue, but part of better learning and dynamics.

siddhartha-gadgil · 2019-03-08T11:31:23Z

This is now ready for implementation given the generation of equation terms.
We don't need to use predictions but can use actual nodes that are constructed.
Key cost: discriminate between existing and non-existing nodes that are equation terms for the given node.
Islands: Can model mapping into islands by a low rank linear map plus a scalar multiple of the identity.
Given a new term we can simply learn afresh starting with the state we had already learnt.

siddhartha-gadgil · 2019-05-24T03:59:51Z

Simplify: just use some form of force-directed graph.
This is not representation learning, just a simple form of Vector embedding

siddhartha-gadgil · 2019-07-16T13:18:40Z

Can use bi-directional flows as in #256

siddhartha-gadgil · 2019-09-30T11:04:50Z

This has been implemented a while ago as RepresentationLearner, but

we must test.
we may want bi-directional flows for robustness.

siddhartha-gadgil · 2020-03-20T01:18:36Z

code2vec may be directly applicable here.

siddhartha-gadgil · 2020-08-17T09:22:50Z

The transformer approach may be much better.
It lets us avoid unnatural linearizations; we can use positional embeddings based on representations in place of trignometric functions, for example.

siddhartha-gadgil · 2021-02-25T05:50:21Z

Representations: force directed

We represent elements using a force-directed model, based either on explicit forces or predictions.
If we use predictions, then positive and negative samples correspond to attractive and repulsive forces.
We can have a single repulsive force based on distance of points in the representation, stronger when they are close.
We have a norm, essentially a descriptive complexity, on elements.
Elements with small descriptive complexity are attracted to the origin.
We get a similar attraction between the lhs and rhs of relations when the other terms have smaird attll complexity.
A third attractive force depends on the representations: essentially if the representations of f and g are close and so are those of x and y, then there is an attraction between f(x) and g(y).
We have a variant of the above where we consider projections and have attractions if these are small, but should be a projected attraction.

siddhartha-gadgil · 2021-03-19T05:56:14Z

Representations: Use predictions and dot products

A model where we learnt two representations and predicted with dot products was not much worse than force-directed.
On the other hand, this needs far fewer choices.
Hence it is probably best to go with this.

siddhartha-gadgil added the enhancement label Sep 14, 2016

siddhartha-gadgil closed this as completed Mar 2, 2017

siddhartha-gadgil reopened this Mar 8, 2019

siddhartha-gadgil mentioned this issue Apr 7, 2019

Strategy and attention using LocalProver, Vector embeddings #225

Closed

siddhartha-gadgil removed the feature label Apr 23, 2019

siddhartha-gadgil changed the title ~~Representation learning~~ Representation learning or Vector Embedding May 24, 2019

siddhartha-gadgil added the notes Using the issue to keep as notes label Jul 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Representation learning or Vector Embedding #101

Representation learning or Vector Embedding #101

siddhartha-gadgil commented Sep 13, 2016 •

edited

Loading

siddhartha-gadgil commented Nov 17, 2016

siddhartha-gadgil commented Mar 2, 2017

siddhartha-gadgil commented Mar 8, 2019

siddhartha-gadgil commented May 24, 2019

siddhartha-gadgil commented Jul 16, 2019

siddhartha-gadgil commented Sep 30, 2019

siddhartha-gadgil commented Mar 20, 2020

siddhartha-gadgil commented Aug 17, 2020

siddhartha-gadgil commented Feb 25, 2021

siddhartha-gadgil commented Mar 19, 2021

Representation learning or Vector Embedding #101

Representation learning or Vector Embedding #101

Comments

siddhartha-gadgil commented Sep 13, 2016 • edited Loading

siddhartha-gadgil commented Nov 17, 2016

siddhartha-gadgil commented Mar 2, 2017

siddhartha-gadgil commented Mar 8, 2019

siddhartha-gadgil commented May 24, 2019

siddhartha-gadgil commented Jul 16, 2019

siddhartha-gadgil commented Sep 30, 2019

siddhartha-gadgil commented Mar 20, 2020

siddhartha-gadgil commented Aug 17, 2020

siddhartha-gadgil commented Feb 25, 2021

Representations: force directed

siddhartha-gadgil commented Mar 19, 2021

Representations: Use predictions and dot products

siddhartha-gadgil commented Sep 13, 2016 •

edited

Loading