[Tp-legal] Position Paper "Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs"

Paweł Kamocki pawel.kamocki at gmail.com
Do Feb 27 14:08:08 CET 2025


Dear Philippe, dear all,

@Philippe: Thank you very much for sharing this, most inspiring!
@all: would you agree that these Knowledge Units (definition below) are in
fact elaborate DTFs?:

Knowledge Unit (KU): A set of entities, at-

tributes, and relationships, capturing a short origi-

nal text excerpt.

Each Knowledge Unit captures:

• Entities: the core concepts or objects in the paragraph,

with relevant attributes.

• Relationships: statements that connect or link entities,

such as causal or definitional relationships.

• Attributes: statements that describe entities according to

the excerpt.

• Context summary: A few sentences summarizing the pre-

vious knowledge units.

• Sentence MinHash: A list of MinHashes of the source

sentences used to generate this KU.


Kind regards,

Paweł

On Thu, 27 Feb 2025 at 09:41, Genêt, Philippe <P.Genet at dnb.de> wrote:

> Dear all,
>
>
>
> Today, a position paper (with participation of TIB, LAION, L3S, Uni TÜ
> etc.) has been published that envisages extracting “Knowledge Units” from
> in-copyright scholarly texts that can be used by LLMs in a legally sound
> way. I think it may be of some interest to you. J
>
>
>
> The paper can be downloaded here: https://arxiv.org/pdf/2502.19413
>
>
>
> Cheers
>
> Philippe
>
>
>
> --
> Philippe Genêt
>
> Koordinator DNB at Text+
>
>
> Deutsche Nationalbibliothek
> Fachbereich Informationsinfrastruktur
> Adickesallee 1
> 60322 Frankfurt am Main
>
> Telefon: +49 69 1525-1847
>
> E-Mail: p.genet at dnb.de
>
>
>
> text-plus.org <http://www.text-plus.org/>
>
> dnb.de <http://www.dnb.de/>
>
>
> --
> Tp-legal mailing list
> Tp-legal at lists.dnb.de
> https://lists.dnb.de/mailman/listinfo/tp-legal
>
-------------- nächster Teil --------------
Ein Dateianhang mit HTML-Daten wurde abgetrennt...
URL: <http://lists.dnb.de/pipermail/tp-legal/attachments/20250227/1f790c2f/attachment.htm>


Mehr Informationen über die Mailingliste Tp-legal