Obtaining transformation matrix (for other languages)

Hi,

First of all, I would like to thank you for this great package and inspiring paper.

I'm just following your Quick Start Guide (https://github.com/prodriguezsosa/conText/blob/master/vignettes/quickstart.md) to get familiar with the procedure of applying embedding regression.

I have mainly 2 questions right now. 

1) Do you provide a formula to learn the transformation matrix? In the tutorial you use `cr_transform`, while you mention that this is based on an estimation by Khodak et al. (2018) (khodakA.rds). However, is the formula below (from your 2023 paper) somehow implemented in `conText`?

<img width="343" alt="image" src="https://github.com/prodriguezsosa/conText/assets/47723699/f15da0a9-8e71-4f24-865f-8b5b05f7792c">

2) This gets even more important if one tries to move to another language. If I would analyze the semantics of English texts, this would probably work with the provided data from the dropbox, however, I would like to analyze German text with respect to political ideology.

While German word embeddings are available: https://www.deepset.ai/german-word-embeddings, I haven't found corresponding transformation matrices for ALC so far. So I thought I could maybe rely on a function within your package or have to train the learn the transformation matrix on my own with the above formula? Are there any further obstacles when moving to another language beyond the pre-trained embeddings and the (yet to be estimated) transformation matrix?

Thanks in advance!

Best,
Lukas

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Obtaining transformation matrix (for other languages) #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Obtaining transformation matrix (for other languages) #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions