Results 301 to 310 of about 15,938,867 (355)
Some of the next articles are maybe not open access.

SimPO: Simple Preference Optimization with a Reference-Free Reward

Neural Information Processing Systems
Direct Preference Optimization (DPO) is a widely used offline preference optimization algorithm that reparameterizes reward functions in reinforcement learning from human feedback (RLHF) to enhance simplicity and training stability.
Yu Meng, Mengzhou Xia, Danqi Chen
semanticscholar   +1 more source

ORPO: Monolithic Preference Optimization without Reference Model

Conference on Empirical Methods in Natural Language Processing
While recent preference alignment algorithms for language models have demonstrated promising results, supervised fine-tuning (SFT) remains imperative for achieving successful convergence.
Jiwoo Hong, Noah Lee, James Thorne
semanticscholar   +1 more source

Reference and Referring

2012
Original essays on reference and referring by leading scholars that combine breadth of coverage with thematic unity. These fifteen original essays address the core semantic concepts of reference and referring from both philosophical and linguistic perspectives.
openaire   +2 more sources

REFERENCES AND REFERENCE LETTERS

2014
• Have a separate section for references at the bottom of your CV, or if you have no space mention them in your cover letter. • A reference letter increases your credibility as a suitable candidate as it is written by a (presumably) objective third party who has had direct experience of working with you and who can substantiate both your technical
openaire   +1 more source

Direct Singular Reference: Intended Reference Andactual Reference

2000
Abstract In the ideal (and normal) case of successful direct reference to an individual particular (1) there exists a particular individual that the speaker means (i.e. intends to refer to), (2) the singular term (name or description) the speaker uses applies to that individual, (3) there is an individual the audience would, in the ...
openaire   +1 more source

References/References

Callaloo, 1989
Aimé Césaire   +2 more
openaire   +1 more source

Tagalog Reference Grammar

, 2023
Paul Schachter, Fe T. Otanes
semanticscholar   +1 more source

REFERENCE: REFERENCES

The Lancet, 1975
openaire   +1 more source

Home - About - Disclaimer - Privacy