Abstract:Arbitrary symbolism is a linguistic doctrine that predicts an orthogonal relationship between word forms and their corresponding meanings. Recent corpora analyses have demonstrated violations of arbitrary symbolism with respect to concreteness, a variable characterizing the sensorimotor salience of a word. In addition to qualitative semantic differences, abstract and concrete words are also marked by distinct morphophonological structures such as length and morphological complexity. Native English speakers show sensitivity to these markers in tasks such as auditory word recognition and naming. One unanswered question is whether this violation of arbitrariness reflects an idiosyncratic property of the English lexicon or whether word concreteness is a marked phenomenon across other natural languages. We isolated concrete and abstract English nouns (N = 400), and translated each into Russian, Arabic, Dutch, Mandarin, Hindi, Korean, Hebrew, and American Sign Language. We conducted offline acoustic analyses of abstract and concrete word length discrepancies across languages. In a separate experiment, native English speakers (N = 56) with no prior knowledge of these foreign languages judged concreteness of these nouns (e.g., Can you see, hear, feel, or touch this? Yes/No). Each naïve participant heard pre-recorded words presented in randomized blocks of three foreign languages following a brief listening exposure to a narrative sample from each respective language. Concrete and abstract words differed by length across five of eight languages, and prediction accuracy exceeded chance for four of eight languages. These results suggest that word concreteness is a marked phenomenon across several of the world's most widely spoken languages. We interpret these findings as supportive of an adaptive cognitive heuristic that allows listeners to exploit non-arbitrary mappings of word form to word meaning.

Automatic generation of a large dictionary with concreteness/abstractness ratings based on a small human dictionary

Automatically Creating a Large Number of New Bilingual Dictionaries

Around the world in 60 words: A generative vocabulary test for online research

Low-Cost Generation and Evaluation of Dictionary Example Sentences

Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum

Low Rank Multi-Dictionary Selection at Scale

Automatic Construction of Clean Broad-Coverage Translation Lexicons

Concreteness ratings for 40 thousand generally known English word lemmas

Efficient Dictionary Learning with Sparseness-Enforcing Projections

Presence or Absence: Are Unknown Word Usages in Dictionaries?

Sddb: A Self-Dependent And Data-Based Method For Constructing Bilingual Dictionary From The Web

Non-Arbitrariness in Mapping Word Form to Meaning: Cross-Linguistic Formal Markers of Word Concreteness

Where Divergent Ideas Converge: Answers to AUT Found on Short List of Word Co-Occurrences Terms

Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal

Automatic Construction of Sememe Knowledge Bases via Dictionaries.

A prompt construction method for the reverse dictionary task of large-scale language models

Vocabulary Size Influences Spontaneous Speech in Native Language Users: Validating the Use of Automatic Speech Recognition in Individual Differences Research

Revisiting the concreteness effect: Non-arbitrary mappings between form and concreteness of English words influence lexical processing

Deep Lexical Hypothesis: Identifying personality structure in natural language

Can large language models help augment English psycholinguistic datasets?

Automated Scoring of Scientific Creativity in German