In Russian
Contents Türkic languages

Contents Türkic in English

Classification of Türkic languages
G. Ekholm Germananic Ethnology
C. Stevens Gmn.-Türkic traits
A. Toth German Lexicon
A. Toth Turkic and English
R. Mc Callister Non-IE in Gmc. languages

Türkic borrowings in English
Türkic in Romance
Alans in Pyrenees
Türkic in Greek

Alan Dateline
Avar Dateline
Besenyo Dateline
Bulgar Dateline
Huns Dateline
Karluk Dateline
Kimak Dateline
Kipchak Dateline
Khazar Dateline
Kyrgyz Dateline
Sabir Dateline
Seyanto Dateline
Start Türkic in English


Whatever is the state of this posting, it is incomplete. It is only a primer. This work is presented as a point of departure, to initiate new curiosity and inspire more independent studies. A wealth of lexemes is not listed, plenty of details on each one is untapped. Many lexemes are brief succulent words of “mysterious” provenance, like boss and OK. Many are dialectal forms from only a few Türkic languages. The road is wide open, the surface has been barely scratched. The light is not at the end of the tunnel, the light is with us. Many surprises are awaiting to be unwrapped.

N. Kisamov
Türkic Substrate in English
Journal of Eurasian Studies,
October-December 2013, Volume V, Issue 4
Mikes International, The Hague, Holland, 2013, ISSN 1877-4199
© Copyright Mikes International 2001-2013, All Rights Reserved

Russian text
Òþðêñêèé Ñóáñòðàò â Àíãëèéñêîì
â ñáîðíèêå
Âîïðîñû ýòíîãåíåçà è ýòíè÷åñêîé èñòîðèè íàðîäîâ Ñðåäíåé Àçèè, Øàìñèääèí Êàìîëèääèí (ðåä.), Âûï. 1, 2016, ñòð. 191-240
LAP LAMBERT Academic Publishing, Saarbrucken, Deutschland, ISBN 978-3-659-95130-5


https://web.archive.org/web/20130914022422/ Web Archive
For Germanic substrate hypothesis refer to that Wikipedia article, which suffers a major bout of blindness indirectly addressed on this page.
For a complete Irek Bikkinin's article “Turkic Borrowings In English”, 1994, click here: http://www.tatarica.narod.ru/world/language/tat_eng.htm (in Russian), Turkic Borrowings In English (in English).
Valentyn Stetsyuk, 2003, Research of Prehistoric Ethnogenetic Processes in Eastern Europe
Comments on Indo-European linguistics are here: http://www.ece.lsu.edu/kak/ary2.pdf

English Linguistic Timeline
The graph is a simplified draft. Etymological panorama needs to show contribution of each component,
including missing ones, and their real fractional contribution to modern live language.

Since the initial publication of the article in 2013, the substrate word list has steadily grown by about doubling, along with expanded narrative part.

Türkic Substrate in English

      Sample list of cognate Türkic–English words
      Sample list of identical Türkic–English words
Table 1a. Frequency listings for Türkic–English correspondences
Swadesh list
Latin and Türkic Languages
Table 2a. Türkic–Latin–English lexical correspondences
Sanskrit and Türkic Languages
Table 2b. Türkic–Sanskrit–English lexical correspondences
Germanic and Türkic Languages
Table 3. Türkic–English suffixes
Table 3b. Türkic–English prefixes
Table 4. Türkic–English lexical correspondences
Etymological notes
1. General
      1.1 Personal and demonstrative pronouns
      Table 5. Türkic–English pronominal correspondences
2. Morphology
3. Verbs
4. Nouns
      4.1 Body
      4.2 Dress
      4.3 Social
      4.4 Religious
      4.5 Commercial
      4.6 Household
      4.7 Cooking and food
      4.8 Animals
      4.9 Life
5. Adjectives
6. Other
Chuvash-Germanic lexicon
Table 6. Chuvash–Germanic correspondences

Author’s Foreword. The objective of this book is history, a language is no more than its ocular. To the curious, the work demonstrates the tangible origin of a good portion of the European languages from historic and genetic perspective. I have a very rudimentary familiarity with any live Türkic language, and for the linguistic contents, this compendium relied on attestations found in literature and dictionaries, and not in a small degree on insights and kind help of native speakers, Türkic and non-Türkic. The fundamental facts supporting this work are altered neither by the want of linguistic competence, nor by the want of unflawed presentation. The scrutiny is useless to those who already know it all. The work consists of two parts; the first part contains an overview of the history, methods, aspects of the problem, premises, and broad observations in the form of mini essays. The second part contains a catalogue of the lexical material under a heading Etymological Notes. With a vast measure of translations, quotations, references, and comparisons, such a mass is apt to accrete mistakes, misunderstandings, and misinterpretations. The father of linguistics M. Kashgari said a millennium ago, Yazmas atım yağmur, yañılmas bilge yañku “only rain shoots without misses, only echo is a sage without mistakes”. The surface of the historical canvas has been barely scratched. There is plenty of room for kind advice and improvement. A spectrum of averted subjects are ready for examination.


Kim ol jolsuz ersä aŋa keŋrü jol
Who does not trod a beaten path has his course wide open

Turkism (or Turkizm) is a word in any language that comes from Türkic languages, directly or indirectly. The adjective Turkic (or Türkic) applies not to an individual language, but to the entire linguistic family numbering some 40+ languages with various degree of mutual understandability. Unlike the structured pairs Celtic/Celt, Turkish/Turk, British/Brit, the collective adjective Turkic (Türkic) does not have a standard counterpart noun in English, since the word “Turk” is already used as a concrete noun counterpart for “Turkish”. Very few names of the Turkic ethnoses and languages contain the part “Türk”. Most linguists know the difference between Turkish and Turkic/Türkic, and do not confuse them. In this book, for clarity, the complementary terms Türkic and Türk are used as collective designations.

The term “Proto-Türkic” refers to a continuum of languages with similar syntactic typologies that formed a lingua franca Sprachbund.  The use of the term “Proto-Türkic” is better avoided since it is inaccurately used as a formed seed language under inapplicable Family Tree linguistic model. As a result of diverse linguistic amalgamations, its functional designation is better defined with the unconventional term “Türkic Sprachbund”. Following N.Trubetzkoy (N.Trubetzkoy 1923, 1928), the Türkic linguistic family belongs to the Sprachbund language union formed between language families consisting of unrelated Türkic, Uralic, Mongolian, and Manchurian families, dubbed “Union of Uralo-Altaic language families”. A language or language family can at the same time belong, or fluctuate between, two different Sprachbunds, such as Indo-European fluctuating between the Mediterranean and Uralo-Altaic Sprachbunds. A Sprachbund is defined as an areal group of languages with common traits:
1. considerable similarity in syntax;
2. similarity in principles of morphological structure;
3. a large number of common cultural words, and optionally
4. a surface similarity in sound systems, lacking however any systematic sound correspondences, correspondences in morphological elements, and common elementary vocabulary.

In contrast to the Family Tree model, amalgamation occurs on a case-by-case basis, it does not follow any particular “linguistic law”. Amalgamation is always unique, even when it occurs in nearly the same linguistic, economic, and environmental circumstances. It is a result of stochastic human interactions, environmental events, and very stochastic human whims. It follows the rule that there are no rules, only some tendencies with uncertain statistic frequencies. In that, amalgamation agrees with the Family Tree model and in particular with the Indo-European linguistic hypothesis where they accept a premise of statistical probability for tendencies inaccurately named “linguistic laws”. The Turkish language, for example, is amalgamated of dozens of languages, each with its own history and idiosyncrasies, not all of them entirely Türkic, with a substantial influence of the local, Arabic and Persian cultural layers. Just the Islamic segment of the M. Kashgari’s work of the 11th c. lists 8 Oguz and 2 Ogur monolingual Türkic communities, and 10 bilingual Türkic communities, neither of those accounted for the Arabic and Persian cultural influence. Nearly completely outside of the M. Kashgari’s orbit were the huge western and eastern segments of the Turkic phylum, many of which over the last millennium also contributed to the Turkish language. In the last two centuries, many languages experienced profound changes under unification, division, cleansing, reforms, leveling, and increasing influx from culturally and geographically domineering languages. All IE languages passed through a similar agglomeration that affected many linguistic aspects including typology and phonetics.

Typological features of the Türkic languages include an agglutinative and exclusively suffixing word structure, sound harmony, verb-final word order, with dependents preceding their head, and use of numerous nonfinite verb constructions. The Türkic long vowels disappeared way back, they only remain in some geographically and phonetically scattered dialects (Kyzyl and tribal Türkic languages of the southwestern Turkey). The clay of the Türkic languages is extremely malleable, while the clay granules and the affinity of different granules are fairly firm. This combination of properties allows smooth reuse of the same granules in very different functions, similar to the economy of English, where for example crossing a crossing at a crossing point recycles a three-phoneme stem and a three-phoneme suffix as a noun, verb, and an adjective. Agglutinative languages, like Turkic, Finnish, Sumerian, do not use articles, since suffixes serve as discriminators between verbs and nouns, and articles need not to be borrowed, unless a language is creolized and had lost its functional suffixes. Typically, articles are borrowed from à lingua franca or amalgamating languages, like the Hungarian the is a borrowed that, at times in a form of a calque, like the Türkic bir is a calque of one. In most cases such cultural exchanges are transparent and traceable. These typological features are largely shared by the “Union of Uralo-Altaic language families”, and their elements are shared by a wide spectrum of the Eurasian and Native American languages. The commonalities between the Eurasian and Native American languages attest to the existence of the “Union of Uralo-Altaic language families” prior to the peopling of Americas approximately 15,000 years ago. The word “Union” in this context is as stochastic as a stratified by elevation mountain forest with some species randomly scattered along the slopes.

The term “substrate” in linguistics refers to an indigenous language that in the process of diffusion and convergence contributes features to the language of the later migrants, it presupposes distinct layering of the language, defines the temporal sequence, and the direction of convergence. The term “substrate” implies a distinct formed local language at the time of its encounter with a distinct formed alien language. By definition, the English being a Germanic language, the substrate of English was overlaid and supplanted by Germanic, making it a Germanic language, and then by the Old French, it was gradually enriched by Celtic, Danish, Latin, and other lexicons that also carried numerous Turkisms. The term “substrate” presupposes a linear development, a particular case of an amalgamation. Amalgamation is a layered process. Where amalgamation processes are not linear, the term “substrate” from an absolute becomes a conditional term, changing its content to suit a specific situation. That is the case with English, where the amalgamation process is far from being clear, is an object of the research, and the term “substrate” is a conditional convention to name one of the basic components.

To recognize substrate words within a mixed lexicon was proposed a complex of logical diagnostic criteria arranged in an order of relative importance:
1. candidate word with dubious or no etymology in the host family
2. candidate word with a base root within the substrate family
3. wide distribution (>50%) of the compared words in the substrate and host families
4. semantic affinity
5. phonetic affinity

Each substrate candidate should be examined along the minimal, but largely insufficient criteria, to validate a candidate and at least to reveal additional factors supporting or contravening a substrate affinity. Frequently, applicability of individual criteria is hindered by a paucity of sources, especially in the temporal aspect. The criteria of a comparatively later appearance of a word in a host language (e.g. Lat. vs. LLat.) vesus an earliest attestation help to come to a reasonable conclusion. Given that just to be rated a candidate a substrate candidate satisfies criteria 1, a bias in the prior evaluations may flag a preference for attribution of the cognates to an advocated linguistic group, with a support from a circular logics. The only objective path to a reasonable conclusion is an equitable comparison of attested base roots, without appeal to “reconstructed” unattested models.

Languages are bio-cultural hybrids, products of evolution and hybridization. Linguistic process, like all processes in universe, are cyclical. Processes emerge, run, and decay. Processes are controlled by obstructions. Processes run as interactions between the energy of the process and sturdiness of the obstruction. Processes drug chunks of obstructions forward, carried by the process flow, drop them off as some barriers built of various obstacles. They continue in a new quality, resuming anew after some equilibrium point, embarking on a new cycle under new conditions. Regular patterns arise from random starting conditions, and end with the rise of the new random conditions. In linguistics, little is known of the cycles preceding the advent of our era, save for few known cycles that happened on the European peninsula at the end of the western Eurasia. The most powerful processes were connected with the expansion of the Greek and Latin languages, which in the process of Europeanization, later called Indo-Europeanization, overcame the resistance of the local languages. The advent of the Classical languages waned with Industrial Revolution, opening an era of cultural and technological languages: French, German, and nowadays English. In the 2nd mill. BC, long before the run-up to our era, the Türkic languages of the Kurgans were on the wane, supplanted by the resurgence of the local European languages. Before that, the Türkic languages were widespread, in lieu of historical records that is attested by the Türkic substrates in the European languages. Some languages consist largely of obstructions, others are a robust mix of obstructions and sediments. Of the pre-historic processes, we can only get a glance using the imprecise measurements of archeology and genetic part of the biology, and some fragments of the early writing. Prior to the scientific archeology and genetics, the paucity of the knowledge allowed any fancy to steady and bloom, alternating between predominant and counter-culture story.

The term “cognate” in linguistics refers to the words derived from the same word in an ancestral language: if a word exists, it has to come from somewhere. Examination of cognates is the method of historical linguistics, it seemed to fit well into a Family Tree model, and is complemented by the concept of the loanwords. The model, however, drives the examination of the cognates, instead of the other way around, in the process creating discrepancies and grounds for critical re-examinations. The developing of the Sprachbund concept and the Wave model was a consequence of the misuse of the Family Tree model far outside its range of applicability. It is, however, impossible to define the range for something lacking a definition and identified by individual gut feelings. Once a hypothesis coagulates into a theory, there is a specific domain where a theory is applicable. Beyond that domain a theory is false, and a viable theory validated by its predictive capability has to supersede it.

As a means of communication, language is not tied to a location, it migrates with the people, and can superimpose on the local languages (adstrate) or, in the case of replacement of the local languages, serve as a basis (substrate), incorporating elements of the local languages. The idea of migration was already seeded in the William Jones' discovery of the kinship between classical Sanskrit and European languages. That kinship could only come with migration, hence the start of the endless Urheimat odyssey in search for a single focal point chimera. All languages that surround linguists are a result of amalgamation. A chimera of a proto-language had to be contrived like a creation mythology, it could not hail from the linguistic experience. Migrations created mixed populations that were creating their languages in a convoluted reciprocal evolution of many individual streams. They were intertwined into a continuously changing web of overlapping Sprachbunds delineated by physical or cultural barriers. Individual streams formed and dissolved randomly, feeding back on the societies and their languages in endless cycles of creation and attrition of the speech and its facets.

An ability to discern the inner skeleton of a word parsed from the barnacles enveloping it is like using an optical magnification. A word reveals its inner core, a core of a living reef enveloped by layer upon a layer of sediments leading to its present appearance. The core skeleton survives largely intact, like a fossil bone. A concept of a word root is a part of the comparative linguistics method that routinely invents new “reconstructed” roots when one is needed to fill a gap between the reality and a mystical “proto-world”. In practice, the invented IE “proto-world” is frequently a phantom clone of a real attested root from a linguistic family outside the students’ mental horizon. That is the case, for example, with the notion “stay” and its immediate derivatives, an allophone of the Türkic üstü “stay”, lit. “on feet, standing”, from the root üst “get up”, “standing on feet”, lit. “atop, aloft, upward” formed with a locative suffix -t/-d “up, upward movement”. The words “stay, stand”, etc. elided the initial vowel, hence the unattested IE phantom clone *sta- in lieu of the attested üst. Both elision and prosthemia, so regular within the bounds of the IE etymology, appear to run into a mental block outside its bounds, where mechanical deduction takes over.

A distinct source for English was the Anglo-Saxon language, it was a backbone and a main vehicle that carried substrate language to the Old English and eventually to the modern English. A good portion of the substrate language, albeit quite late in time, is recorded in the Anglo-Saxon dictionaries and grammars. Numerous lexemes that escaped a contemporary record but found their way into English without direct borrowing from the neighboring languages may have also come from the Anglo-Saxon and its siblings, and absent counterindicators thus accepted as belonging to the substrate. Anglo-Saxon was a member of a distinct cluster of languages in a loosely-knit body of mutually comprehensible dialects, fairly far removed from Gothic and other Germanic languages.

A very considerable uniquely English lexical portion appears late and apparently from nowhere, with cognates known only from Türkic languages; that type of words can also be deemed to be a part of the undocumented substrate layer. A conspicuous lexical portion is shared by many archaic languages in the area, affording a good possibility that it was also shared by the substrate of English. Since Türkic languages were developing in mobile and open societies noted for their coexistence with alien peoples, at least some of the English substrate may be re-borrowing of the local lexemes.

In the context of this compendium, the term “linguistics” refers specifically to the theory and practice of historical linguistics, a narrow branch of the entire philological field which studies lingual evidence of the past. The term “IE etymology” refers to the self-contained circular version of the historical linguistics with etymological studies focused solely on languages viscerally postulated to be Indo-European (IE). The IE linguistics is not a parcel of philology that represents the full universe of communications, it is rather an isolated parochial interpretation of a confined domain. To not appear to be built to order, linguistic reconstructions, and especially etymologies, must be based on plentiful actual material independent of the research method, they should reflect realities of time, space, and social world, and withstand critical analysis in any aspect. Unlike a science, where asserting irreproducible findings falls to ostracism, the IE practitioners are lauded when their tentative proposition looks convincing enough to be held for a mighty fact. Its not unlike any other etiology in the practice of promotional propaganda. Otherwise, etymology is a study of the sources and development of words, to find the sources wherever they may be found in our full universe.

The term “IE languages” refers to a theoretical construct that presupposes a linear evolution of a single linguistic kernel, deemed theoretical or real, into a huge branch, defined by a spectrum of intuitive perceptions of a “common knowledge”-type conceptualization. Unlike any other scientific discipline, it does not rely on its mathematical apparatus. For now, there is no definition of the “IE languages” other than the trade practitioners' personal intuitive perceptions. While some practitioners believe that the Proto-Indo-European (PIE) language existed as a formed language used by real people, others believe that it is a theoretical construct like the one time ether, useful for canvassing a model but that never existed in reality. In practice, both sides treat the hypothetical construct as a proven reality. Omission of material contravening concept is driven by confirmation bias that ignores disconfirming evidence, a propensity to seek only suitable examples whilst flouting those confounding. Excluded exceptions undermine the concept to a useless subjective speculation. The subject of the PIE and IE languages remains, historically and hence on, prone to patriotic approaches of all suits, marked by systemic disregard of annoying contrary evidence and fabrication of faux scenarios. And, the same fuzzy concept was mechanically applied to other languages, at times against all contraindications.

Genetic studies have broken the glass wall that separated and counterposed the “Indo-European” and Türkic worlds. A significant part of the Europeans turned out to be descendents of the Kurgan people, marked by the inheritance of the Y-DNA haplogroups R1a and R1b. From the standpoint of the parochial “Indo-European” linguistics, that would compel inclusion of the Türkic family into the “Indo-European” superfamily, ostensibly consistent with the contents of this work. That expansion, however, would clash with the historical reality outside of the linguistics’ control. The “Indo-European” languages became “Indo-European” during the Corded Ware period, in contact with the nomadic Kurganians, and the Kurgan culture not only precedes by far the Corded Ware period, but its migrational waves were the immediate cause for the formation of the distinct Corded Ware communities. The “Indo-European” languages were formed in the presence and under repeated influence of the Kurganians’ languages.

The Türkic linguistic map is uniquely broad in its Eurasian extent. It extends in the Southwest from Near and Middle East on the Atlantic Ocean's fringes to the Southeast, to Eastern Turkistan and on into China toward Pacific Ocean. Then it stretches to the Northeast covering Southern and Northern Siberia to the Arctic Ocean, and Northwestward across Western Siberia and Eastern Europe. The Türkic languages are fascinating with their vast geographical distribution, their contacts with many different linguistic zones, their peculiar stability over time, their morphological and syntactical regularity. Their spread represents a great number of different peoples and languages, most of them politically and scientifically reduced to a vanishing point. Currently, there are about twenty Türkic codified languages, about 42+ acknowledged languages, and about the same number of unacknowledged, amalgamated, vanishing, unexplored, and trace languages. This was actually the beginning of a Western European front against Russian Turkology that came to dominate for decades. Across Eurasia, Türkic languages lived through the periods of expansion, blossoming, migration, contraction, devastation, mutilation, genocide, prohibition, and prejudicial, cultural, political, and scientific bans. Currently, except for the lingering prejudices, most of the previous severe restrictions on studies the life of the Türkic folks have abated.

Türkic languages, from the time of M. Kashgari, are divided into Western and Eastern languages. The M. Kashgari's notion of language is not known, probably it differs significantly from the modern interpretations. Probably, his definition was a common perception based on identical typology and morphological and lexical similarities; his notion of Western and Eastern languages probably saw Kashgar as a central point. M. Kashgari could not have known of the nomadic migrations beyond his temporal and spatial horizons, nor of the Türkic languages that diverged too much to be comprehensible to the speakers of Türkic languages of his interpretation. Today, the M. Kashgari's Western languages can be roughly associated with the Oguric languages, and his Eastern languages can be roughly associated with the Oguzic languages; the h- languages of the Caspian-Aral area do not figure at all in the Oguric/Oguzic classification. The term “Common Turkic” (CT) or “Shaz Turkic” refers to one of the taxa in some classifications of the Turkic languages, in practice it is a euphemism for the Oguz-type languages. It stands in opposition to the Ogur-type “r” or the Chuvash “l” languages. Taxonomy does not cover all languages, since none of the Turkic languages escaped numerous amalgamations with linguistic varieties found across Eurasia, and the linguistic variety exceeds by far the suggested simple taxonomic criteria. Debate on the place and chronology of Oguric within Türkic continues, the differences are studied predominately through the shreds of Danube and Itil Chuvash Bulgar, without a benefit of the languages like the Hunnic that historically were connected with the Ogur languages.

However, the Western-Eastern association is period-dependent: Oguric languages in the historical time expanded from the Mesopotamian-Caspian-Aral basin to the Ordos, reverted back to the Caspian-Aral basin, and expanded westward to the Eastern and northwestern Europe. Nomadic communities used to leapfrog over occupied territories, and spread over remote defenseless sedentary populations, thus the once western languages could become extreme eastern languages, vice-versa, and anything in-between. M. Kashgari could not have known of the Oguric languages much outside of the Moslem world. Similar swings happened to the M. Kashgari's Eastern languages that during the historical time traversed Eurasia from Mongolia to Balkans and beyond. The genetic pallets of the Türkic ethnicities, drawn starting from the last decades of the 20th c., provide a good illustration of the admixtures that compose each ethnicity. Genetic pallets of the nation-states and nations united within a common linguistic family also provide a good graphic illustration of the fallacy of the Family Tree models.

In the context of this compendium, the term Western Türkic languages refers to the pre-historic languages of the Kurgan western waves plus the Western languages of M. Kashgari, and the term Eastern Türkic languages refers to the pre-historic languages of the Kurgan eastern waves plus the Eastern languages of M. Kashgari. Under that hazy definition, the Türkic language dictionaries compiled in the Muslim world largely cover the Eastern Türkic languages, with accidental elements of the Western Türkic languages spilled to the Muslim areas. The multi-ethnic composition of all Türkic confederations, always based on a system of marital unions between non-blood related and largely alien members, precluded formation of a single common language that can be identified with a single ethnic or political entity; any lingua franca of any confederation inevitably was a blend of few, at times very diverse, languages.

Although the first written evidence of the Türkic languages comes from Mesopotamia (5th-3rd mill BC), archeological evidence shows that Mesopotamia was a fringe finger much to the south of the bulk of the eneolithic Kurgan people, whose expanse of the Khvalynsk and Sredny Stog cultures (5th mill. BC) were focal points for the coming expansions. The first written attestations came with the very first inscriptions in the history of the mankind, the Sumerian and Türkic yer, English “earth”; the Sumerian sig-, Türkic sok, English “sock” (beat); the Sumerian akka, Türkic aga, English “age”, and numerous others. Such cognates have not deserved the attention of those who declared Sumerian to be a “language isolate”. Not the typology and grammar instigated the assertion, since European philology holds those as malleable. These cognates testify that not only the Sumerian was a flourishing amalgamation of the Türkic and a local language, but that it is still an essential component of the undoubtedly agile modern Türkic and English languages. Türkic vernaculars were settling in the Middle East for about 6 thousand years with each influx of the mounted nomads, at least 3 thousand years before the Indo-Aryan arrival there ca 1500 BC, and many times after that. The Türkic component found its way in all three stages of the Persian language, contributing to its complexity as a blend of primarily local, overlaying migrant vernaculars, and multi-stage vernacular amalgamations, and to the drastic changes between stages. With such complicated history, outside of the late Islamic innovations, any presumption asserted in attributing to the Persian language as an ultimate source is jeopardizing credibility. Etymological data shows way less Persianisms in the Old Türkic (10th c.) than the Turkisms in the Persian. Historical processes outside of the contemporary Moslem world nearly completely escaped attention of M. Kashgari.

M.Kashgari AD 1072
Map of the World

Language abbreviations

Alb. Albanian   Fin. Finnish   Latv. Latvian   Rum. Rumanian  
Ang. Anglian   Flem. Flemish         Rus. Russian  
Ar. Arabic                    
Arm. Armenian 811 Fr. French   Lith. Lithuanian   Sax. Saxon  
A.-Sax. Anglo-Saxon   Fris. Frisian   Lat. Latin   Scand. Scandinavia  
Av. Avesta   Gael. Gaelic 13 Luz. Luzian   Serb. Serbian  
Az. Azeri   Gaul. Gaulic 6 M Middle   Skt. Sanskrit  
Balt. Baltic   Gk. Greek   MHG Middle High German   Sl. Slavic 158
Beng. Bengal   Gmc. Germanic   MLG Middle Low German   Sloven. Slovenian  
Blr. Belarusian   Gmn. German   MM Middle Mongol   Slvt. Slovak  
Boh. Bohemian   Goth. Gothic   Mod. modern   Sp. Spanish  
Bosn. Bosnian   Gujr. Gujrat   Mong. Mongol   Sum. Sumerian  
Bulg. Bulgarian Slavic   Hebr. Hebrew   N North   Sw. Swedish  
Cat. Catalonian   Hitt. Hittite   Norw. Norwegian   Tat. Tatar  
Ch. Chinese 60 Hu. Hungarian   O Old   Taj. Tajik  
Chuv. Chuvash   Icl. Icelandic   OCS Old Church Slavonic   Tr. Türkic  
Cimr. Cimbrian   IE Indo-European   OE Old English (Anglo-Saxon)   Turk. Turkish  
Croat. Croatian   Ir. Irish 146 OHG Old High German 244 Turkm. Turkmen  
CT Common Türkic   It. Italian   OT Old Türkic   Tuv. Tuvinian  
Dag. Dagur   Khak. Khakass   P Proto-   Ukr. Ukrainian  
Dan. Danish   Khal. Khalka Mongol   Pers. Persian   V vulgar  
Du Dutch   Kirg. Kirgiz   Phryg. Phrygian   W West  
Eng. English   Kor. Korean   Pol. Polish   Welsh Welsh 118
Est. Estonian   L Late   Pruss. Prussian   Yak. Sakha 5


Archeological and genetic works demonstrated migrations, amalgamations, and replacement of populations in the Western Europe, where the Germanic branch of the Indo-European (IE) languages occupies a prominent place. Linguistic works demonstrated that Germanic branch contains a substantial layer of non-Indo-European substrate. The English language is a prominent member of the Germanic branch. The sources of the Germanic substrate remain debatable, with numerous candidates explored and rejected. With the insights provided by archeology and genetic, and based on their converging contention that from the time of the population replacement in the 3rd millennium BC until the middle of the 1st millennium BC, the Türkic (Proto-Türkic) linguistic field dominated the whole Eurasia reaching the Atlantic Ocean on one end and Pacific Ocean on another end, a concept was formulated and substantiated that the non-Indo-European substrate of the Germanic branch was rooted in the Türkic (Proto-Türkic) linguistic field. The groundwork for this linguistic concept has already been established, the concept is a necessary corollary of the positively attested migratory flows. The concept explores the Türkic–English morphological and lexical correspondences, and finds substantial traces of the Türkic substrate in English, potentially exceeding 30-40% of the English words used in the daily life. Of the English suffixes, 63% descend from the Türkic origin and remain morphologically active in forming English words. The concept touches on the substantial trace of the Türkic–Latin–English correspondences, linguistically corroborating the thesis that the Kurgans' circum-Mediterranean path via the Pyrenees to the Continental Europe brought about the Beaker Culture, ancestral to the Pra-Celts and Pra-Italics. The Beaker Culture's language in terms of the Celtic archeology is called Celtic, but is meant to name that ancestral language.

The results of the study corroborate archeological and genetic conclusions, on the example of the English and Latin languages providing a salient amount of linguistic evidence in their favor. The results introduce solutions for lingering questions, raise questions about adopted dogmas, and open gates for multi-discipline studies of the questions raised. Far from exhaustive treatment of the subject, the study attempts to follow a holistic view on the Eurasian history with languages being one of the major human attributes where the past instigated the present. It explores a portion of relevant information where neglected evidence is disproportionally substantial in relation to the attention it has been receiving. The study leaves much material unexplored, it only extends to a degree sufficient to substantiate the proposed concept and illustrate stochastic nature of the past linguistic dynamics. In the world where observation, not preconception, describes the reality, preconception is fleeting. The study draws attention to the perspective research in this field in a diachronic typological , which, in my opinion, may shed light on reconstruction of the linguistic prehistory of Eurasia.

The concept fits in well with the theoretical and experimental facts. It fits well with the fact that nobody could ever substantiate the location of the IE Urheimat, the area where the IE languages were born, that people had thought existed. It fits well with the assertion that the Scytho-Iranian Theory was a poorly executed hoax concocted for political ends at a certain apex of the late colonial mentality. It fits well with five dozens or so counterindicators that demonstrate impossibility of the said Theory. Some counterindicators are as weighty as the genetic conflict between the predominantly Kurgans’ R1b Y-DNA haplogroup and the predominantly Iranian J2 haplogroup, the biological conflict between the milk-based diet of Kurgans and the general lactose intolerance of the Iranians, and the conflicts of the literary, ethnologic, linguistic, and ethnic appellations’ nature. It fits well with the archeological conclusions on the Kurgan migrations, with the genetic dating of the Kurgan migratory paths, with the genetic composition of the North-Western European population, with the existence of the substrate language underlying Germanic languages, with the European geography of the Classical period, with the abundance of Turkisms in the Anglo-Saxon, Gothic, Latin, and other documented languages of the Classical and Late Classical period. The predictive capability is well beyond serendipity at a random examination of Proto-Germanic *prototypes where the IE etymology stops dead at the *PG model. Random experiments with random sets of suspected Turkisms in English found that in about one-half of the cases the suspicion was justified, depending on physical or abstract notion, by direct or readily apparent metaphoric concordance. Moreover, in many cases both literal and metaphorical semantics of a word constitute instances of paradigmatic transfer. Random statistical testing demonstrates the reality of the Türkic-Germanic link. Finally, the concept is supported by the incompatibly potent explanatory capacity, and a viable predictive capability for minor and crucial indicators.

A number of the Scytho-Iranian Theory counterindicators have a predictive capability, they have already accurately forecasted the future genetic findings before they became confirmed by facts. They have already accurately forecasted the salient presence of Turkisms in the archaic European dictionaries. They open a gate for future discoveries. Combined altogether, the various facets of the concept are weighty enough to inspire confidence that the concept is well substantiated.


Attention to the Germanic-Türkic commonalities must have arisen many times over on various levels. Probably, it far predates the discovery of the IE connections. Over the centuries, many word lists may have been produced impromptu, and they circulated in the Internet Age, primarily for the German language due to historical exposure. Typically, a closer look at such listings related to English produces a high degree of etymological misfires, the remaining small minority tend to be repetitive, inevitably inclusive of a standard complement of obvious clones, mostly nouns: ace, dawn, do, earth, guest, eat, say, son, and the like. On the other hand, about every tenth word turns out not to be a false lead, and a few more appear plausible but on inspection lacking a preponderance of evidence.

E.Forrer (1934) raised to a scientific discussion the observation that Germanic languages have a non-Indo-European substrate. Before that, the linguistic terrain belonged exclusively to the IE studies, based solely on the Family Tree model. All apples were falling at the root of the mama tree, and all saplings were its kids. The concept of substrate had infringed on that idyllic cartoon. Not only the mamma could grow in a distant banana grove, but the saplings could be various kinds of hybrids. The IE Theory was incessantly criticized and adjusted since its inception two centuries ago, and it is still lacking its fundamentals, such as a basic definition of what is and what is not Indo-European. An archaic notion that at some time there existed the PIE Adam and Eve speaking PIE language has been shelved long ago, pulling the rug from under a stipulation that at some time, somewhere, existed a community that used to speak the PIE language, that the PIE is not a chimera. An evidence that PIE was not a compact language, and even not a language at all, pops up everywhere. Either lexically, phonologically, or morphologically, the IE Family Tree is perplexing.

Perpetuation of the obvious etymological manipulations in support of the IE Theory has an opposite effect of seeding discontent. To account for the contraventions, the model has to be disfigured with areal and contact interpretations fused into a single-dimension paradigm, with optative inspiration that a three-dimensional amalgamation of Family Tree and a Wave models may putatively address the nature and distribution of the correlations found among the IE languages. A few core languages provide well attested IE correlations, while the optimistic attribution of other languages to the family rests on statistically insignificant spotty evidence carried by mounds of interpretations. Aside from notoriously circular argumentation, the Family Tree model is bound with fluid interpretations of undefined parameters. Beyond a fuzzy notion that a majority (dubbed “consensus”) of some particular scholar community should be convinced on attribution of a language to a particular linguistic group, no defined criteria requires to meet any evidentiary specifications neither for pro nor for the con position. Conclusions are driven more by embedded ideological trends than by analysis. In the end, the PIE comes out as a clash of beliefs, rather than a clash of evidence. The problem remains a tag-of-war between competing opinions of the IE linguists.

Genetic evidence shows a massive demographic imprint of the R1b haplogroup in the Western Europe and in the Inner Asia. In Western Europe this demographic imprint is connected with the Germanic languages, in Inner Asia it is connected with the Türkic languages. Another unique for the mankind common biological adaptation is the lactose tolerance that could only germinate and survive in a pastoral economy dependent on dairy products for subsisting and successful reproduction. Biological connection is independent of nationalistic notions, ideological biases, religious affiliations, dress codes, and other curses of the New Era civilizations. Instead, that connection carries common nomadic societal traits atypical for the Classical Europe: respect for individual liberties, autonomous parliamentary organization, respect for women, and spirit of mobility. Logics bears to seek demographic commonalities in the fields beyond the innate biological and societal domains. Language, being a major human demographic attribute, can't escape from demographicly carrying its uniqueness form generation to generation, and across all kinds of societal and demographic perturbations. As objects of cultural exchange, linguistic elements permeate incompatible languages, living linguistic clues everywhere. Inescapably, linguistic elements are passed along across generations, millenniums and populations. Unlike the strands of DNA revealing the story of human migrations, theoretic linguistic strands live their own life of abstruse and denial, obfuscating rather than attesting linguistic flows across our small globe. In the surrounding richly amalgamated world, linguistics of the past had created its own closet world of linear surrealism little permeable to the rest. Linguistic clues sloth about, unneeded to divine past wanderlust behind sundry present.

An unexpected development came from the genetic studies that confirmed nearly complete wipe-out of the “Old Europe” population by the 3rd mill. BC, and its replacement by the mounted Kurgan nomads, long stipulated by the archeologists. A corollary of the population replacement by the waves of the pastoral Kurgans is that the Türkic languages of the Kurgans replaced, penetrated, and amalgamated with various European languages still in the 3rd mill. BC. In one form or another, the Türkic languages dominated most of the Europe as lingua franca, although demographically, the pastoral populations are always sparse. The “Old Europe” populations found a refuge in the Eastern Europe, from where in the 2nd mill. BC their descendants migrated to the South-Central Asia, and in the 1st mill. BC their other descendants bounced back to the Western Europe. The IE migration to the South-Central Asia from the Eastern Europe is reflected in the diminished IE element in comparison with the Germanic languages (Prokosch 1939). After some wild excursions in space and time, the PIE theory is about to curdle down to the N.Pontic geographically and 3,500 BC temporally. Conceptually, that puts it right next to, and thus intersperced, with the real Kurganians, and enables it to at least discursively to accept the iniquity of amalgamation.

Domestication of animals for food was a first step in liberating people from the burden of daily provisioning. Domestication of horses vastly increased that benefit, it made provisioning portable. A feedback was revolutionary. It initiated a self-perpetuating technological revolution that extends to our days. Pastoralism sparked a mass of inventions that crucially changed the human society. Just the invention of a portable home and a wheeled mobile home required a creation of a technological complex spanning numerous engineering fields. Pastoralism freed societies from the constraints of immobility and torpidity. It required daily ingenuity, cultural openness, flexibility, and providence. It removed constraints of climate, distance, areal containment, and stewing in its own juice. It opened a path to flourishing societies, to continent-wide cultural and technological exchange. It shared its knowledge far and wide with the sedentary neighbors, both of agricultural and hunting economies. That did not get unnoticed, news of the pleiad of the nomadic teachers is found in classical literature of literal nations, Greek, Indian, Chinese, etc. Nomadic societies were on the move and evolving, and facilitated explosive evolution among their static neighbors. They either spawned or spread totally unforeseen novelties, like parliamentarism, religion, and personal freedom.

Genetics corroborates independently migratory flows marked by distinct archeological traces. Their corollary defines linguistic situation in the Eastern and Western Europe in the course of the 5th – 1st mill. BC. Earlier migrations set up conditions for the following migrations preceding to and during the period of the Great Migration of People in the 1st mill. AD. Genetics helped to clarify the phenomenon of the Celtic migration, it corroborated archeological understanding of the Celts coming from Africa to Iberia at about 2800 BC, and traced their migration in a circum-Mediterranean movement to its source in the Eastern Europe of the 6th-5th mill. BC. Some linguistic elements, shared by the Eastern European languages in the 6th-5th mill. BC, survived both the overland and circum-Mediterranean movements of the Kurgans, and along with the later migrations and local vernaculars, they formed the Germanic substrate now found as linguistic vestiges.

Among potential Germanic substrate donors were suggested Fennic, Uralic (Wiik, 2002), Semitic (Vennemann, 2003), Tyrsenian (Steinbauer, 1999), but due to the episodic nature of the linguistic parallels, none of them gained an acceptance. Consensus remains with the S.Feist's assessment of about one third of the Proto-Germanic lexemes originating from a non-Indo-European substrate, and that the Germanic languages were a result of pidginization and creolization or koineization of that substrate with the later adstrate. Pidgins are simplified conflated languages with trimmed grammar and limited vocabulary; a pidginated language is stripped down to the roots of nouns, verbs, and adjectives, with vestigial morphology and inflection, a fluid word order with prime accent on action, and a use of uncoupled vestigial elements as modal semantic determinants. Within a single or few generations pidgins develop into viable creole languages with fairly fixed syntax, grammar, morphology, and syntactic clauses. Any semantic gaps are filled in with local innovations that replace the lost conventions of the parent languages. A pidgin stage is an innate part of the human development starting from the moment of birth. The pidgin processes are fairly uniform. The most rudimentary SVO syntax, not burdened with convoluted conventions, tends to tarry on in the formed language. Here again, the northwestern Baltic, Slavic and Germanic languages differ by inconsistent syntactic preference attributable to contacts with SOV-type languages, in our case specifically with the Türkic languages of the steppe Kurgans.

The spread of the European languages during the era of the global colonization had created numerous pidgin islands across the globe, spreading Indo-European languages to all kinds of natives, and promoting Indo-European languages to become a predominant linguistic family of the entire world. Nowadays, computers and English, with a twist of exploding literacy, do the same, creating vernacular English creoles across the globe. The process, started by the first waves of the mounted Kurgans to the Europe and Far East, continues unabated.

Based on the combination of the archeological, genetic, and linguistic indicators, the present concept was formulated and substantiated that the substrate of the Germanic languages was or were languages of the Türkic linguistic family, whose male speakers had a frequent marker of the haplogroup R1b, and who amalgamated with the local populations marked by the predominant male haplogroups I and N. The haplogroup I is identified with the native European populations, in particular the Balkans and Scandinavia, noted for their sedentary lifestyle, cereal farming, and military vulnerability. The haplogroup N is identified with the northern Eurasian Fennic populations, noted for their sedentary lifestyle, hunter-gatherer economy, sparseness, and military vulnerability. The haplogroup R1b is identified with the Kurgan cultures, noted for their horse nomadic husbandry, high mobility, and high military aptitude.

The proposed concept disbands unsustainable IE etymologies, analyzes and explains the peculiar geographical linguistic distribution stretching from Northwest India up to the Pontic Steppe and on to the North Sea, corroborates archeological and genetic evidence of migrations and amalgamations, is in concordance with the documented history of the Eurasian Steppe and Northern European people, is in concert with historically attested ethnological and cultural distinctions, and is constructively helping to rid the swollen IE paradigm from unrelated burdens. It allows to discern particular groups and distinctions that contributed to the evolution of the English language. It uses hard statistical probability in lieu of warm linguistic feelings to discern correspondences from similarities, disbanding speculative justifications and accounting for wide distribution of similarities across Eurasia to the significant exclusion of the Near East that muddles the IE paradigm. It demonstrates partial adoption of an entire lexical, morphological and script paradigms that can be accounted for only through borrowing, attesting to demographic fusion. The material that forms the concept is not amenable to a flawless reverse interpretation. Impervious to the evidence and testimony the faux etymologies fall flat. The circular logics, endemic in general linguistics and particularly universal in the IE methodology, is eschewed as counterproductive. It demonstrates futility of limiting the field to poorly substantiated and arbitrarily applied phonetic processes, to a reflexive exclusion of all other indicators. The concept illustrates mechanisms of the typological change, replacing habitually postulated miraculous conversions with demographically-driven and component-specific mechanisms, like a pidginization leading to a linguistic analyticity, Cf. English and Bulgar among the European IE languages. The concept strives to avoid elastic definitions drafted to include diverse material, it seeks to rely on attested evidence in lieu of theoretical concoctions. It allows to place the few core languages in a historical perspective, stripping them of a special ancestral status without throwing the baby with the water.

The converging genetic dating allowed to trace genetic markers in space and time, and draw observations about their migration, spread, and timing. According to Klyosov 2010, “The modern Uigurs, Kazakhs, Bashkirs, and some other peoples of Siberia, Central Asia and the Urals descend in part from the ancient R1b1 branch, and by now retain the same haplogroup for 16,000 years. The “Türkic-lingual” haplogroup R1b expanded from the South Siberia, where it formed 16,000 years ago, across the territories of the Middle Volga, Samara, Khvalynsk (in the middle course of river Volga) and the Ancient Pit Grave (“Kurgan”) archaeological cultures and historical-cultural complexes (8-6 thousand years ago and later, the common ancestor of the ethnic Russians with the haplogroup R1b1 lived 6,775 ± 830 years ago), northern Kazakhstan (for example Botai culture dated by the archaeologists 5,700 - 5,100 years before present (BP), in reality much older), passed through the Caucasus to Anatolia (6,000 ± 800 BP by the dating of R1b1b2 haplogroup of the modern Caucasians), and through the Middle East (Lebanon, 5,300 ± 700 BP; the ancient ancestors of the modern Jews, 5,150 ± 620 BP), and Northern Africa (Berbers of the R1b haplogroup, 3,875 ± 670 BP), crossed over to the Iberian Peninsula (around 4,800-4,500 BP, present day Basques 3625 ± 370 BP) and further on to the British Isles (in the Ireland 3,800 ± 380 and 3,350 ± 360 BP for different populations), and to the continental Europe (Flanders, 4,150 ± 500 BP, Sweden 4,225 ± 520 BP).” The trace is seen as linear, while its object is turbulent, consisting of eddies and vortices of local and global dimensions. The exponential increase in the accumulated quality and quantity of the available genetic analyses tends to advance precision of the early dating, adding sharpness and geographical detail to the temporal assessment without substantial challenge to its topography.

According to the archeological evidence systemized by M. Gimbutas, 1994, Europe also experienced three major Kurgan overland migration waves, some of them were repeat migrations into the same areas. The dating of the Kurgan migration waves, produced by archeologists using radiocarbon analysis, is in concert with the genetic dating: wave 1 at c. 4400-4300 BC, wave 2 at c. 3500 BC or somewhat later, and wave 3 soon after 3000 BC; the circum-Mediterranean Celtic Kurgan wave reached Europe independently at 2800 BC. Along its route, the circum-Mediterranean wave remains archeologically unexplored. Between the 3000 BC wave and Sarmatian migrations of the 2nd c. BC, there is a historiographical lacuna, but considering the sequential waves of the Huns, Bulgars, Avars, Kangars-Bechens, and Oguzes of the 1st mill. AD, there is no reason to suspect an absence of the Kurgan migrations during the lacuna period. It is reasonably expected that the waves, separated by the timespans on the order of millenniums, were likely composed of linguistically differing tribes of the same linguistic family but complemented by different allies, were impacted by the specifics of their migration routes and their durations, and were bringing to the new territories their particularly distinct vernaculars. Although belonging to the same nomadic horse-breeding Kurgan historical-cultural complex, they possessed different technologies, starting with the Neolithic, and ending with the metals.

The notion of the Türkic languages underlying European languages is nothing new. At the time preceding the emergence of the Scytho-Iranian Theory by two and a half centuries, as far back as 1653, M. Van Boxhorn suggested that a Scythian language was the Family Tree's proto-language of Baltic, Celtic, Germanic, Greek, Iranian, Romance, and Slavic languages. At the time, in the 17th century, following the heritage from the Antique period, the old collective name Scythians was vaguely synonymous with the Türkic people, at the time known as undifferentiated Turks or Tatars. The vagueness of the term was predicated by the vagueness of the European knowledge about people further east, and especially the pastoral nomads; practically nothing was known of the pre-Islamic northern Eurasians, their kurgans, their cultures and scripts. The Van Boxhorn's suggestion was numerously modified by later scholars as new studies filled in the scholarly gaps, and survived in the crevices left out by the rise of the nation-states with their reinvented and politicized histories. According to the Türkic substrate concept, Van Boxhorn's suggestion is solidly corroborated, especially in respect to the English language, and to the Germanic languages in general. Some other European languages do not lag too far behind, each with its own historical and linguistic peculiarity.

The breakthrough afforded by the genetics helped in dislodging interpretations based on simplistic presumptions, it allowed to correlate migration of the genetic markers with migration of certain archeological cultures and people. The R1a and R1b haplogroup markers were felled from their “European” pedestal, they were found to originate in the South Siberia - Northern India area, and being dispersed west and east in a sequence of numerous migrations. Paleogenetic studies allow to peek into genetic composition of long-gone cultures and people, either bridging the past and present or demonstrating a demographic disconnect between them That is best illustrated by the western European peninsula, where the modern predominance of the marker R1b contrasts with the paleogenetic predominance of the markers I and G. The gullible presumption of the genetic continuity was felled, the realization that the Western Europe is largely populated by the Asian migrants has taken over. After many migrational waves, the marker R1b has survived and blossomed, while their companion markers R, R1, R1a, and many others, present in the past migration flows, have faded.

That does not negate their value in the genetic studies, not only to better describe the past populations, but also to cross-correlate genetic dating for the populations' samples. An asymmetry of the male Y-DNA and female mt-DNA markers, with fast-changing Y-DNA and slow-changing mt-DNA, may fairly well reflect a demographic picture (changing males and stable local females), but provide a skewed linguistic picture, with languages propagated on the maternal side playing a relatively larger role than the ones propagated on the paternal side. These correlations affect dissemination of linguistic traits, and our perceptions of the mechanics of the linguistic development.

Reverse tracing the Indo-European languages, before their partial migration from the N.Pontic to the South-Central Asia in the 2nd mill. BC, and a back migration to the Western Europe in the 1st mill. BC, tells that the bulk of the linguistically European population lived in the Eastern European refuge, with dispersed linguistic isles of the “Old Europe” surviving across Western Europe and with a considerable isle in Scandinavia. The European refugees of the “Old Europe” camped in the Eastern Europe from the beginning of the 3rd mill. BC. Before that, they were the “Old Europeans” and the future Indo-Europeans of the Western and Central Europe, they were farmers, and many of them amalgamated with the pastoral people of the Kurgan wave 1. The linguistic traces of the “Old Europeans” are still with us, the Balto-Slavic and Indo-Aryan/Sanskrit daughter languages.

The Kurgan wave 1 of the mid 5th mill. BC was neither peaceful, nor destructive, it replaced the rustic social organization of the “Old Europe” with culturally alien elite, with demographic and linguistic situation largely left intact. It was the old Neolithic “Old Europe” under a new management. The new world order lasted for 15 centuries, or 60 generations, till the destructive Kurgan wave 2. Before the Kurgan wave 1, the Neolithic “Old Europe” was peaceful and happy for millennia, from the time of the introduction and spread of farming, which supplanted the rare scattered hunter-gatherers and greatly affected their demography by boosting their fecundity. The few members of the “Old Europe”, excavated from the Central European “killing fields”, belonged to the Y-DNA Hg I and G. The genetic picture is regularly improved by new paleogenetic data.

Conceptual map of Kurgan westward waves with datable genetic markers (arrows)
(background R.R.Sokal et al. 1992 and M. Gimbutas 1994)

This detailed genetic picture provides an impetus for linguistic studies. Presently, English is credited with approximately 800 Türkic cultural loanwords of mostly medieval Ottoman and Kipchak origin (Bikkinin I., 1998), of them about 250 are found in common English dictionaries, and are listed in the Wikipedia “List of English words of Turkic origin”. According to the Türkic substrate concept, a deeper linguistic layer forms the substrate layer of the Germanic languages, and particularly of the English language. The fresh approach would not change the body of the factual evidence, it would bring out new connections instead of trying to trod the same path and expect different results. Fresh approach either challenges the standing theories and hypotheses, or confirms them till an emergence of a new challenge.

For the sedentary societies, the Kurgan expansion and population replacement, attested by the “killing fields” of the period between 4500 and 4000 ybp, would overlay a continuous chain of mutually incomprehensible vernaculars every 200 km. This value would not apply to the mobile nomadic society, where the linkages are much longer and alien encounters are much more frequent, resulting in more pronounced effect of linguistic leveling. Still, with the longitudinal distance of 55°, as depicted on the Conceptual map of Kurgan westward waves above, and the timespan of 2,500 years, the development of local Sprachbunds is unavoidable. Allowing a theoretical 5-fold increase in the linear spacing would divide the European theater into 10 conceptual Sprachbund areas, 5 areas wide and 2 areas across. Given the relative stability of the roots in the agglutinative languages, the interplay between these European Türkic Sprachbunds and later historical events that shaped various European languages would create a raster of allophones for each word, at times barely recognizable, united by their origin from a small group of relatively close vernaculars.

The last Kurgan waves belong to the Iron Age, they are connected, in sequence, with the last northward movements of the Scythians coinciding with the formation and rise of the Roman Empire and with Kurgan migration from the Central Asia to the Urals. The Uralian Sarmat men were supplanted by the migrants, who inaugurated the Late Sarmatian Age. Retreating Sarmats overrun the N. Pontic Scythians, and expanded into the Central Europe, turning it into European Sarmatia described by Ptolemy and Strabo. In the process, Europe gained theretofore unknown nomadic people called Wendeln and Goths, Burgunds and Turings, and others that pushed out the old Scythian nomadic tribes into inaccessible Netherlands and Jutland. With the newcomers came their vernaculars, mixing, and amalgamation.

The next Kurgan wave is known under the name of the Huns, although its bulk consisted of the Early and Late Sarmatians known under variety of names. With the Hunnic wave came more vernaculars, mixing, and amalgamation. From the Hunnic wave arose Anglo-Saxons, history becomes much less blurred, and we receive the first records on the languages of amalgamated nomadic peoples, in case of Anglo-Saxons and Goths a blend of recognizably Türkic and unrecognized local European languages that become known as Germanic languages. From the same milieu of the Türkic and local vernaculars rose the Slavic languages.

The next Kurgan wave was already termed Türkic, it brought along Avar, Bulgar, and Suvar Türks, and numerous tribal names, but their western influence did not extend much beyond the Slavic linguistic area. All subsequent Kurgan waves, those of the Bechens (aka Pecheneg), Oguzes (aka Torks and Türks), and Kipchaks (aka Polovetses) did not extend beyond Balkans. The last Kurgans that reached central Europe were a branch of Bulgar Onogurs with their allied Ugro-Finnic allies who took over Pannonia and formed the Magna Hungaria. The last Kurgan wave was that of the Chingizids, it was stopped at Adriatic and barely affected the Balkans. The Hunnic wave was the last one connected with the Türkic substrate of the English language.

The eventful life of the Kurgans shaped their languages. According to Turkologist S.E. Malov 1952, “Western Türkic languages show that they had very rich and long life, they experienced many different influences and other exposures. That could not have happened in a very short period. In the west, all settling of Türkic people from Central Asia that we know of (for example, the Huns, Mongols and Tatars, Kyrgyz) did not exert influence and shift toward Eastern Türkic linguistic elements that could be expected if here in the West have not been established the steady and well-ossified Western Türkic languages”. The western traces of the Türkic languages ​​are deep and wide, so deep that they continue their active and productive life in the languages across much of the Europe, so wide that they could not have been wiped out by the pre-Industrial Age extermination campaigns. The traces are neither erasable nor untestable.

Long before the horse nomadic Scythians brought their kurgans from Siberia to the N. Pontic (ca 8th c. BC), kurgans were already ubiquitous across much of the Europe. The first kurgans came to Europe with the wave 1 of the pastoral horse-riders ca 44 c. BC, by that time the Kurgan Culture of the Eastern Europe was about 3 millennia old. Wave 1 established a European forepost, wave 2 fortified the revolving east-west links, and the wave 3 drastically changed the European demography and culture. In turn, Europe was changing the nomads. A major factor was the transition from the severe continental climate to the oceanic climate, with its mild winters and cool summers. Instead of subsisting on under-the-snow winter forage, herds had a continuous round-the-year access to fresh grass. Sedentary hunter-gatherer and farming populations were defenseless against the mounted nomads, they could easily be levied for labor-intensive supplies. The long-range meridional and mountain elevation migrations, and migrations to the winter refuge became unnecessary. Herds of horses could remain in situ, and be complemented by incapable of the long-range migrations cows and sheep. Compared with horses, cows and sheep produced an abundance of milk and meat. The technology of mobile homes waned in favor of the permanent settlements. An abundance of waterways reduced the need for overland travel that faded to became a secondary option. The extent of the year-around pastures allowed to bring pastoralism into the theretofore inhospitable latitudes, particularly to Scandinavia and the Isles with their pastoral economies surviving to this day. Economic adaptations turned once perilous and isolated pasturing routs into communal ventures that involved diverse ethnicities and social strata. They were a medium which grew cultural, syncretistic, linguistic, etiological, social, and technological adaptations. The hallmark of the adaptations was expressed by the Central Europe's regional Tumulus culture (Gmn. Hügelgräberkulturca, 16th - 12th cc. BC), another name for the Kurgan culture, from the Türkic tumlu “tomb”. Germanic and other European tribes were “Huns” long before the coming of the Western Huns to Europe in the 4th c. AD.

After some unknown prehistorical events, ca 8th c. BC, masses of the European nomads migrated northward with their minions, to the Scandinavian area. Climate-wise, the area afforded the same favorable conditions, but otherwise it was an area reliably secured from attacks by the wide bodies of water. That probably was a reason for migrations. The following two thousand years saw plenty of demographic activity, little known initially, but well documented for the later literate period. The events of that period attest to all kinds of amalgamations, diffusions, and divergency. There are indications that on the way, Denmark accepted Cimmerian and Scythian refugees who established their own enclaves behind the safety of marches and water barriers. Ethnically distinct, these nomadic migrants had enough commonalities with the Scandinavian pastoralists to coexist and amalgamate. These processes are fairly well known, up to the specific name of Ases of the Norse sagas, but lightly studied for historical reasons, the nomadic pastoral culture fell by wayside in comparison with unduly glorified culture of the farming population and the Viking glory. On one hand, the skewed science brings skewed results, but on the other hand it leaves plenty of room for historical explorations that would right aberrations in many involved disciplines, not the least, and probably most heavily, in linguistics.

Available dictionaries cover Anglo-Saxon and Old Saxon from the 7th c., the Old German from the 8th c., the Old Frisian from the 13th c. copies of the older material, and the Jute that combined Jute and Cimbri (the Classical Cimmerians) from the 13th c. The Angles, Saxes, and Jutes were the most important initial contributors to the development of the English. The Anglo-Saxon lexis remains a solid backbone of the English, it formed in the Central Europe before the 5th c. migrations to the British Isle. English was greatly transformed in two periods of fast and cardinal changes, the first in the 11th c., the second in the 15th c. The first transition from the Anglo-Saxon or Old English to Middle English was connected with the Norman conquest, the second transition from the Middle English to New English was technologically driven by the introduction of typography.

English has estimated 500,000 words, absorbed from every imaginable language; about 30% of the lexicon ascend to the Germanic, Celtic, and substrate portions; the unique words shared by English and individual Germanic languages serve as indicators of the colorful blend: 120 Anglo-Frisian words vs. 40 Friso-Scandinavian words. Thus, for example, the particular form sin “sinful” came from the Türkic exclusively via Frisian. Only a portion of the Türkic lexicon in the Anglo-Saxon vocabulary survived into English, numerous Anglo-Saxon Türkic words are absent in English, but at the same time English is endowed with numerous Türkic words not documented in the Anglo-Saxon lexicon.

The layer of the Türkic origin may number only few hundred words, but they are the most important words: I, do, this, my, make, give, talk, eat, write, tell, kill, earth, time, day, dawn, body, and the little suffixes that make English the English. They are also the most necessary and endearing household words: mommy, papa, daddy, baby, puppy, doll, lullaby, cry, hash, wake, fart, butt, son, girl, brother, cousin, kin, guest, say, tell, candle, loaf. In the G.Doerfer's classification, these words are “essential basic words”, essential for the daily life and for the substrate concept. Of the Swadesh list of 207 “basic” words, about 62 words or exactly 30% correspond to this study’s entries on the 800-word listing of the English Turkisms. Pointedly, the study did not pursue a targeted examination of the Swadesh-207 list, and the results are random coincidences between two independent tabulations. That allows to predict that a complete examination would produce a number somewhat closer to 40%. In reality, since the Swadesh list is tailored to the specifics of the English with its absence of locative suffixes and similar inflectional markers, the English words indicating inflection can’t be classed as“basic words contained in any language”, making the Swadesh-207 list somewhat smaller, and even in this random examination the actual proportion of the 62 words stands higher than the pedantic 30%.

So far, most of the Türkic words meet etymologists' blank eye, many appear from nowhere in the records of the late Middle Ages as a “folk speech”, which what they precisely were, the speech of the ingenious English folk little affected by the Lat. influence. Quite remarkably, some of these basic words echo in the Chinese and Korean, demonstrating their spread from one end of the Eurasia to another, which only the horse-mounted Kurgan nomads could feasibly achieve. That common thread once propelled, under a spell of the Family Tree model, a suggestion of Sino-Caucasian superfamily.

A study of English Turkisms may be helpful in understanding development of the Türkic languages. The portion of the Turkisms brought to English by the Anglo-Saxons and other concomitant ethnicities may illuminate semantic nuances and extensions that did not survive to the 13th c. late records for the eastern languages. For example, the Anglo-Saxon word sink in the compound naegledsink “studded vessel”, lit. “nail-adorned vessel” uses the word sink for a an object translated as “vessel”, but could be a funeral casket made of a watercraft, or a watercraft used in funerals, or a casket patterned as a watercraft, while the recorded Türkic stem siŋ-/sı:n carries notions of submergence into water instead of floating on the water, a sink basin, and a tomb, i.e. the objects that can't be imagined studded. The older records of the Anglo-Saxon language may also elucidate the apparently unconnected derivatives and concrete nouns. Some basic Turkisms came to us practically unchanged, they may be as old as the parental haplogroup NOP, Cf. earth, don, tell, sink, sail. Other basic words stratified into clusters of barely compatible allophones with very different degrees of recognisability.

We do not know the names of the Kurgan nomads before 2200 BC, we know some names of these nomads from the Assyrian tablets ca 2200 BC: Guties, Turuks, Komans, Kangars; ca 1600 BC in China they are called Juns (Rongs) and Jous (Zhou); at approx. 800 BC in N.Pontic and Asia Minor they are called Cimmerians and Scythians; ca. 200 BC north of China they are called Huns, Juns, Tokhars (Yuezhi), Usuns, Saka, Kangars, and Tele; and in 200-400 AD they are called Huns in the west, in India, and across the eastern Eurasia, and Kangars and Usuns in the center of the Asia. After that, they continue to rush around Eurasia and build kurgans for their deceased for another 1,000 years, coming into the present.

Contrary to the evidence and testimonies, Germanic peoples are held as autochthons of the Scandinavia. Jordanes (6th c., Getica) introduced an opinion that most of the European peoples originated in Scandinavia. At the same time, Jordanes knew of the Goths' “Scythian” origin: “the Goths dwelt in the land of Scythia near Lake Maeotis. On the second migration they went to Moesia, Thrace and Dacia, and after their third they dwelt again in Scythia, above the Sea of Pontus” (Jordanes V.38). The Gothic origins are relayed in the initial legendary part, the non-fictional part of the Jordanes' work starts in the 3rd c. AD with Roman clash with the Goths.

Toponymic research demonstrated a paucity of native German place names in Germany, corroborating that Germans came to Germany in the last centuries BC (Chemodanov, 1962, p. 79). The south Scandinavia, reputedly the original homeland of the Germans, definitely could not populate the whole of the Europe. It has a gap between the cultures of the Bronze and Iron Ages (Mongait, 1974, p. 324), when sometime during the Sub-Atlantic climatic phase (cold wave, 5th c. BC - 50 BC) it was completely deserted, and then re-populated with a reverse flow. Evacuation of the farming population rolled out within a period of less than a century. Then enter the Classical writers, who found Germanic tribes in Gaul and Germany. In the first centuries AD the names Suebi and Germans had an equal currency. Apparently, both the names Germani and Suebi are exonyms, the first means “manly, brave” in Türkic, the second means “vagabond, vandal, wendeln (Wanderers)” in Celtic. The Germanic form wendeln is a calque of the Celtic word. The names Suebi and Germani are umbrella terms like the names Scythians, Saka, Sarmats, Alans, Ases, Goths, Huns, Türks, or Tatars, and like the names Celts, Greeks, and Romans. They refer to horse nomadic pastoral societies, mobile and perilous for sedentary farming peoples. From around 58 BC on, the fragmentary information on Germans becomes fairly continuous. Strabo (1st c. BC) described the category of the Suebi nomads as peoples living off their flocks on meat and milk, with “small huts” yurts and wagons, driving their herds from pasture to pasture, and opposite to the settled farming. The description of Suebi is indistinguishable from the description of the Scythians and their variety. As a number of distinct tribes with their own distinct names, the Suebi tribes occupied more than a half of Germania. The nomads and local farmers coexisted, occupying different interspersed niches. The nomadic part of Europe must have been very sparsely populated, because the pastoral economy is highly manpower-efficient but needs huge pastoral tracts to sustain large herds. East of Germania was the endless European Sarmatia, populated by the Sarmat Scythians, and nobody knew where one ends and another starts. A number of Suebi tribes have transparent Türkic names, Cf. Goths that stands for generic Guz (and gur) “tribe”, Alamanni that stands for generic alaman “raid” (Vorontsov, 2009, p. 65), Hercyni (as in Hercynian forest) that stands for generic “nomad” lit. “kindred (hence the appellative Hun) people”, As that stands for “plain (people)”, its Slavic calque is pole “field”, Polyan “field-man”.

The Normans' genealogical legends also recall that they “came from Asia”, they lived there in an ever flourishing country, richer than the cold Scandinavian coasts. The description of the “old country” echoes that of the Cockney, in Türkic köken “motherland, native place, ancestral land”, a “mythical luxurious country”. The Old Norse Prose Edda provides numerous parallels with the Türkic mythology, etiology, and ethnology. It links the Norman-Germanic genealogy with the god-ancestor Thor (Þor, Þorr), the Tengri of Tengriism and the Chinese Tian “heaven”. Following a prophecy, a legendary leader Odin led a “multitude of Ases” to the country of Saxes, and then on to other regions of the Germany. The Türkic term yazï “steppe, plain, flatland” is almost synonymous with the word alan “plain, flatland”, the names As and Alan were used interchangeably. The confusing parallelism of these two terms in the historical records and in Türkic attests to their common source, it is a salient case of paradigmatic transfer. Under the leadership of Odin, Ases triumphed everywhere over the local population, and in the end “the tongue of these Asiamen was the true tongue over all these lands;... and (that) the Asa brought the tongue hither into the north country; into Norway and into SviÞiod, into Denmark and into Saxland; but in England there are old names of the land and towns, which one may skill to know that they have been given in another tongue than this.” (Snorri Sturluson (ca 1220), 1842, p. 111). Of the nomadic Scythians, Strabo singled out the “best known of the nomads are those who took away Bactriana from the Greeks,... Asii or Asians, Tochari,... who originally came from the country on the other side of the Iaxartes River (Syrdarya) that adjoins that of the Sacae and the Sogdiani and was occupied by the Sacae.” The sources trace Ases and Saxes (Saka, Sacae) from the 140 BC Horezm to Bactria, to the N. Pontic Scythians, and to the vicinities of the Baltic Sea. Before that, in the 3rd c. BC, the Asii-Yuezhi controlled Ordos. The migrants had to carry not only their spoken language, but also their written language, the traditional titles, social and state organization, and a myriad of other cultural and societal traits. The Germanic Normans have a glorious obfuscated pedigree. The Normanic Vikings used to intimidate their neighbors a millennium before they turned to consummate the sitting ducks across Europe.

Snippets of the nomadic traces are scattered in the lexicon, morphology, semantic links, distinct phonetics, and regular phonetic shifts throughout the body of the examined English Turkisms. The body of knowledge inherited from the early days of the Germanic people had to suffer from a cognitive dissonance on the institutional level, when the uncivilized nomadic ancestry was first minimized in favor of the civilized agricultural traditions, then in favor of the civilized religion, then in favor of the civilized wars and colonization, and finally in favor of the civilized parochial patriotism. Linguistics is but one facet of the discrepant humanities; not only matters like earth/erde “earth” is unrelated to the Türkic yer “earth”, but the Germanic runes are unrelated to the Türkic runes, the European jurisprudence is unrelated to the nomadic jurisprudence, the European parliamentarism is unrelated to the nomadic parliamentarism, and so on in the department of new mythology of the European humanities. No applied science discipline can be institutionalized with one blinder on, rather, a minutest discrepancy is mercilessly massaged and studied till it is resolved, exhaustively tested, and applied for practical needs. No applied science discipline can exist without crisp definitions of its basics and effects. No applied science discipline can exist with two sets of contradictory textbooks. Humanities, in contrast, can have three Saxonias, one in Horezm bordering on the Asii, the other the Sakacena in Azerbaijan in the Caucasus, another one in the center of the Europe, and any contiguity is in a state of denial. Or have the two Asii folks, one in Bactria and Horezm, the other by the Baltic, and no contiguity in language or script. Science searches for quark entanglement, the humanities in contrast, in conflict with all counterindicators, postulate a disentangled egression. Instead of amalgamation, is firmly postulated an internal development.

A Zoroastrian source (speculatively dated by 1200 BC, on the events of the 7th c. BC, if that makes any sence) recites the As-related events: “Some Turanians, expelled from the Media country by (the legendary) Kai Khosrow, settled in the northwest karshvar (province) of the known inhabited earth. They crossed the river Erehsh (Arax) and Kurush (Kura), and lived on both sides of the mountain Erezur (Caucasian Elbrus) of the Hara Berezati ridge. These Turanians were also called by the name Ihkuzu (As Scythians). Later, they moved into the vast plain north of the ridge High Hara to the rivers Ranh and Danush (Itil and Don)... The Saka living by Danush (Don) were also called Danavas Turs (Danavo Turs) because in their country flows the river Danush, the longest and most dangerous of the rivers” (Zarathustra, 2002, p. 29). The name Scandinavia may be an echo of the name Saka-Danavo.

In Turkic languages tur/tura means “live, dwell”, “stopover, layover”, “settlement, dwelling”, toručï “inhabitants, population”. The names Tur and Turan are popular with Avesta, in the Iranian languages the names are not etymologized, etymologized, they attest to the Türkic linguistic influence on the Iranian languages of the time. As a specific toponym, Turan persevered for 3 millenniums, predating both Persia and the Han China.

Although the Old Norse sagas clearly depict Odin as a leader of the Asparukh type, who at a time of trouble leads his people to safety, the European historiography presents Odin as a head of a religious pantheon. There is no need to confuse the issue, since the Almighty is Thor. Attempts to re-christen Odin as a supreme being were not successful, under any dressing Odin could not replace the Almighty Thor. The mythological Odin may have been treated as a deified figure, but he remained a figure of flesh and blood, an eponymic tribal ancestor of the Nordic royalty. His name appears to be a title-name, like Attila (Father of Country) and Ataturk (Father of Türks), consistent with the pre-Islamic, pre-Christian Türkic naming tradition. The god-like qualities of Odin are consistent with the properties of the boyar class, who were held in reverence for their foresight, wisdom, knowledge of omens, foreboding, and even magic powers. The prosthetic initial w- in his name is of the same nature as the prosthetic f- in “father”, an allophone of the Türkic ata “father” (Gothic atta) with a pl. suffix to express a notion of respect. The title is a straightforward epithet, an allophone of the ata-ani “ancestor” (lit. “father-mother”), with some dialectal peculiarities. In Slavic languages, the title Odin has a form otchina/votchina (îò÷èíà/âîò÷èíà), from otets (îòåö) “father”, an allophone of ata that also has a variant with a prosthetic initial v-, specific for Slavic languages. It goes without a saying that a mortal, even if deified, can't head a religious pantheon. However, as a deceased fatherly figure, he can provide some occult protection, along the line of the dependable Catholic saints and the time-tested Roman/Greek gods. The name Odin is another relict of the Asian ancestry.

Huns play a major role in the German genealogical legends, an especial place occupies Etzel Attila (Nibelungenlied). The western Hunnic state operated on the model of the prior and later Türkic states, as a confederation of kindred tribes and their sedentary dependents. Attila inherited a state with a long-ingrained notion of the Türkic-Germanic kinship. The blurred footprint prevented Classical authors from delineating where the European Sarmatia ends and the Germania starts. The tribal composition of the Attila's forces at the Chalons (Catalaun) battle and the rise of the Germanic states after the split of the Western Hunnic Empire present a vivid attestation of the confederation membership. In a genetic bird's-eye view, the Roman-Hunnic confrontation and the succeeding melees were a conflict of the R1 haplogroup (R1a and R1b varieties) with the native European haplogroups, a replay of the millenniums-old confrontations during the previous Kurgan waves, including the last Sarmat wave of the 2nd c. BC. Huns left a sizable trace in the European culture and social organization, it took a millennium to suppress the last vestiges of the Türkic religion (in the forms of Arianism, Bogomilism, and other monikers), but the will for freedoms and the social canons have survived as the modern canons. Except for the eastern fringes, the European civilized slavery system did not survive the Hunnic period. Unlike the sedentary civilized societies, barbarians practiced confederations, not enslavement. The individual freedoms and parliamentary system are the vestiges of the steppe societal traditions.


To establish in a systematic fashion a conjectural genetical link between English and Türkic, suggested by history, archeology, ethnology, and biology, comparisons must be made between all linguistic aspects, namely typology, syntax, morphology, lexis, phonetics, distribution, script, and statistical significance. All these aspects must corroborate, or at least not contradict, the proposed concept. The historical background, and the parent and kindred languages systemically infused with Türkic elements, must be a canvas over which the linguistic aspects are drawn.

The discipline that pursues in that direction is called comparative historical linguistics, and that term should be used for the task of identifying Türkic-type layer in English. However, due to self-imposed preconceptions and limitations, that task can be called comparative historical linguistics only with major caveats. From its inception, the discipline of comparative historical linguistics emerged and was defined as a linguistic analysis of the Family Tree model, it is a “normal”, “accepted”, or “established, traditional” discipline; deviations from the “norm” are held abnormal and peripheral. Whatever linguistic classification scheme is selected, it is the Family Tree model, to the deliberate exclusion of other models, like the Wave model, by the very essence of the “normal” comparative procedure. Atomistic in method, the “normal” comparative historical linguistics behaved as an interpretive discipline-isolate, short-circuited on itself, and opposed to a holistic approach.

Historically, ethnic depictions described typical appearances of the alien peoples. In the “them” vs. “us” descriptions, the part “us” is omitted as well known to the reader. For the later students that part was understood as implied opposite: “they” are swarthy, then “us” are “white”, “they” are nomadic or sea-fairing, then “us” are stationary, “they” are blonds, then “us” are brunettes, etc. For centuries. the notion of race underlain and tainted all aspects of ethnic studies, including linguistics. The understanding of the origin and propagation paths of the typological traits is a newer development. The Indo-Arian “blond beast” turned out to be a trait alien to the Mediterranean “us”, it is carried by the genes originated in the northern Eurasians, the Finns and their genetic kins, the carriers and descendants of the Y-DNA haplogroup NO with that unique mutation. In the age of genetics, not once a discovery of light-haired dead was still hailed as an evidence of linguistic Indo-Europeanism. Similar misconceptions in respect to Mongoloidness and other acquired appearances covertly underlain the fundamentals of the comparative linguistics.

In reality, very few societies and their languages fall under the essence of such “comparative” procedure. A test of such peculiar comparative historical linguistics built on the Family Tree model, applied to any language in Europe from the Noah-equivalent point for the historical period, with terminal points from when it was documented for the first time to the modern language, would fail to reconstruct the modern language from the initial one, and vice-versa. That failure would be predicated by the deficiency of the model that neglects the most important factors in development of any European language, those of the Sprachbund effect, demographic effects of amalgamation and migration, demic cultural impact, and impact of mass media that started with introduction of mass print and spread of literacy. In pursuit of the proposed concept, no developmental model should be presumed, it should be a conclusion of the exploration, not its premise or precondition, nor a target nor a guiding star.

A similar position should be taken toward all other linguistic aspects, directed toward examination of the factual material without pre-imposed targets and limitations. Personal beliefs on what is possible and impossible must be dispensed with beforehand. Elementary justice demands a search for, introduction, and examination of every relevant evidence and every testimony for every etymological investigation, with unmitigated attention to counterevidence and wanton presumptions. Instead of being dispensed with as malleable variables to justify diversity within the IE family conglomerate, the subjects of typology and syntax, completely ignored by the “normal” comparative procedure, need to take their proper place as the leading and most conservative properties. The statistical significance stepdaughter of linguistics, which the “normal” comparative procedure knows only from citations of few selected luminaries, need to be a common and productive tool.

In the lexical aspect, opinions on the statistical significance for genetic connection vary from 150 cognates (G.Doerfer 1981) or every 70th word within a presumed basic 10,000-word dictionary, to a probability of 1:100,000 for “individual-identifying evidence” (J. Nichols 1996). That range of “opinions” on primary definition, from 70 to 1 individual, is ridiculous by any standards. For this examination, the G.Doerfer's suggestion, generally deemed to reflect a sound skeptical conservatism, is accepted as a vaguely defined threshold of credibility, as a lexical-statistical point which satisfies a minimal requirement for genetic connection. The lexical paradigm, nearly exclusively used in the comparative linguistics, is a low-hanging fruit. Other traits, like the paradigmatic transfer of the entire cooking lexicon, or the transfer of embedded morphology, are statistically immeasurably more weighty, each additional element exponentially reduces a possibility of falsification, quickly reaching a point of no return.

Under “normal” comparative procedure, geographic distribution rarely gets a cursory look, and when it does, it mechanically expands the depth of the Family Tree model to another level of a mature “pra-language” seed of the Nostratic theory. For this examination, the preconceived Family Tree model is dispensed with, and the term Nostratic is used for putative Eurasian Sprachbund elements connected with migratory flows prior to the domestication of animals as a means of transport. That invention cardinally changed the patterns of cultural exchange, replacing a slow-moving omnidirectional dispersion to a directional fast-pace dispersion.

The concept of genetic connection within linguistic family stipulates a base vocabulary shared by the members of the family. In essence, that is a standing non-codified definition of a linguistic family. Numerically, the base vocabulary is miniscule, numbering 10 to 20 dozen words selected for their stability across all candidate languages, it is a hard linguistic nucleus. In a conceptual 10,000-word dictionary, 200 words constitute 2% of the lexis. The remaining 98% of the lexis constitutes a shell enveloping the base vocabulary, it is malleable, it is not limited by any qualifications. The qualitative division onto the base and shell vocabularies was established by the “normal” comparative linguistics, it was a necessary theoretical accommodation of the fact that 98% of the Indo-European lexis is incompatible between the putative members of the Indo-European family. It also made “guests” an ordinary and predominating phenomenon, allowing a single “guest” to systematically infiltrate numerous languages from numerous linguistic families and their numerous branches. The conditional accommodation freed a 98% of any member's lexis from concordance with any linguistic laws, since the “guests” originated from any walks of life and joined via all kinds of paths.

The freedom from the laws allowed a freedom of typology, it brought, at least theoretically, under the IE umbrella such incompatible languages as Celtic, Armenian, Kartvellian, Ossete, and Kashmir. The conditional accommodation allowed to stipulate an attrition rate for the base vocabulary, at least for the tested European samples. Applying the formulated concept of genetic connection to the families other than IE flashed out an utter parochialism of the concept. In case of Türkic languages, and generally of typologically agglutinative languages, the core vocabulary numbered way in excess of 50% of the lexis, and the stability of the root as a key element was an order of magnitude higher than that for the IE languages. As a systemic diagnostic tool for genetic connection, the concept of the core vocabulary established for the European languages plus Sanskrit and Persian can't be extended across linguistic families other than the IE. For the non-IE substrate languages, the substrate concept must examine cases from the 98% shell, not stay shy from examining cases from the 2% base vocabulary, shrink down to individual words, their ultimate origin, their membership in the paradigmatic transfers, and to attempt to visualize historical paths between disparate linguistic families.

The method of the “normal” comparative procedure assembles a listing of the study's target words in allowed language (say, Lao) suspected to be cognates of the word list with known affiliation (say, Sino-Tibetan); the two listings are juxtaposed, compared, and analyzed in phonetic and vaguely defined semantic aspects, with episodic excurses into known political history as a dating tool. By definition, the method is not holistic, it leaves a bulk of the linguistic properties outside of the purview. At the same time the comparative procedure declines to give a credence to the analogous lexemes from unfavorable candidates. The two-faced-Janus approach proved to be beneficial for the sanity and racial purity of the IE comparative linguistics, imposing few limitations beyond the “normal” scope of purview, but at a price of introducing methodological flaw of circular logic, when the results filtered through the aperture of premises invariably validate these premises. With the circular logic, like in love, beauty is in the eyes of the beholder.

As a linguistic dating tool, the known political history may be a convenient, but profoundly unrealistic approach. The multi-ethnic composition of all Türkic confederations is not an exception, but rather a rule: during historical period, few political entities, even isolates, were monolingual, if only because the mammalian trait of sexual procreation extends way back beyond our historical period, and staunchly continues into our present. Way before the appearance of the human ethnicities, a drive for trophy husbands and wives was crossing any boundaries within the same species, the Neanderthalian admixture within the modern population saliently attests to that. Confusing political, ethnic, and linguistic categories is akin to a scholastic ghettoization of climate, you can do it, but dynamic events disagree with a static model. A fixated search for a ghettoized Urheimat instead of a linguistically fuzzy amalgamated and amalgamating Sprachbund in a time and space motion requires a holistic approach. That, in turn, would re-write definitions for the objective and the target.

The IE linguists routinely stipulate as a justification for the bifurcated methodology that the listings from uncouth suspects can't be taken seriously, asserting that unlike their sibling IE listings they are methodologically flawed, inconsistent with the given postulates, and prove absolutely nothing. The main tool of that approach is not to methodically refute alternate concepts, but to plant a doubt. At the same time, the miniscule qualifying requirements and the absence of concise definition allows essentially unlimited possibility to claim IE genetical connection for languages of entirely different origin, without any consideration for the relationship of the primary vs. the secondary, fluid, and possibly borrowed, properties. For this examination, the bifurcated methodology is dispensed with, the same criteria are applied without prejudice and any preconceptions. The body of cognates in the candidate language is viewed as a paradigm with its own structure, segmentation, and logics; the paradigm has to be consistent with advances accumulated by all disciplines including linguistics.

The “normal” comparative procedure does not trace evolution of the morphological elements, not in a small degree because no elements fit the entire IE Family Tree model, they belong to languages with incompatible typology and incompatible syntax, and offer commonalities only within certain Sprachbund areas. Within and without IE linguistics, the primacy of the language typology and syntax are established facts, phonetic modifications are effects and not causes of the linguistic changes. Consistent application of that cause and effect sequence creates insurmountable conflicts for the IE family concept; advancing phonetics into the cause position allows to save the concept by postulating, selectively for the IE family concept, malleable typology, syntax, and lexis. Morphology is not a building block of the IE Theory, it is its greatest Achilles' heel in need of secure disposition.

The proposed concept, in contrast, treats morphology as a most powerful diagnostic trait, it traces numerous European morphological elements to the Türkic agglutinative morphology, and demonstrates a genetic connection and adaptation to the typologically incompatible European languages. The treatment of morphology is not selectively fitted to any model, with understanding that every individual language formed from inherited, amalgamated, and innovated components, and a part of dialects of any linguistic grouping may carry traces of an archaic phenomenon shared with other dialectal groups, while any member of a dialectal group may replace archaisms with borrowings and innovations. In most cases, the English Turkisms allow to connect an English word with its real primeval root, still active, productive, and surrounded by a slew of morphologically derived offsprings, all ascending to the primeval notion of their living progenitor. The diagnostic tool of morphology is applicable across the entire language, without a need to target examination to particular grammatical classes.

The treatment of lexis within the proposed concept is based on the monumental work performed by IE linguists in the course of the past 200 years. The cognate listings cited in IE etymological compilations in most cases are used verbatim, extending beyond the IE horizon where applicable. One major difference is the accent on semantics instead of phonetics, the corollary of that difference is differentiation between homophones and allophones. Where the IE linguistics accentuates phonetics and treats semantics as a malleable property, a close attention to semantics allows to discard spurious and artificial parallels, cleansing analysis of notional biases aimed at a certain result. The other difference is an open horizon, a corollary of untargeted analysis, it expands linguistic geography to the places where the foot of the IE linguistics did not trod, and brings up systemic results outside of the “normal” purview. Another difference is the independence of the cognate candidates from the much berated IE linguistic laws. That is justified by the absence of linguistic laws that systematize malleability of Turkisms obtained in the processes of population replacement, amalgamation and cultural borrowing, and keeps in the parallels usually discarded by “normal” linguists who apply IE linguistic laws to stochastic phenomena in order to keep the stochastic process tamed and targeted. Another major difference is a healthy skepticism toward reconstructions fancied from applications of dubious laws to stochastic effects. Since the IE linguistic laws are no more than formulated observed tendencies, they carry a statically defined probability, and can be applied only in statistical context.

Conventional scripts that existed prior to codification, brought about by the introduction of printing, are conservative and traceable to their primeval forms. Scripts, however, are a separate branch of science neither endorsed nor used in comparative historical linguistics, to the detriment of the last. The independent life of graphemes is as much a historical evidence as are the spoken and written forms of phonetics. The proposed concept embraces the evidence, conveyed by the comparative historical orthography both in graphic and phonetic aspects, within the limits of immediate perception.

The statistical significance denotes a numerically expressed value of a set of data within a body of data, applied to measured parameters (statistics) and predicted statistical outcome (probability). The most informative statistical measurements in the proposed concept are the frequency of use, which is statistics on relative standing of a particular word in a language, and probability of chance coincidences. The frequency of use statistics allows to appraise numerically the place of the substrate language in the examined language, a most significant parameter of the examined language. The factor of probability of chance coincidences, on one hand, allows to validate or reject a test subject based on the valuation of probably vs. improbable, and on the other hand allows to validate or reject linguistic assertions made from the perceptions of personal intuition and ingrained biases.

Some statistical reality needs to be accounted for in assessing formulaic interpretations “fortuitous, echoic, onomatopoetic”, and similar dead-end etymologies. The limitations of the source base are not uniform, they vary greatly depending on subject, e.g. well documented religious lingo vs. poorly documented daily utilitarian lexis. That shortcoming is especially great in the paramount areas of the nomadic economy, animal husbandry and the metal and weaponry production. At times, barely 10% of the specialty lexicon recorded in the source base matches that documented by special studies, Cf. A. Shcherbak, 1961. The statistical chances of locating a relevant cognate for the niche semantics is really fortuitous, reflecting the ratio of the live lexis vs. the recorded compendium. Since the lexical distribution paths are totally unknown or at best tentative, an omission of a single pertinent cognate is critical in each individual case, denying an opportunity to locate a possible demographic path, and generally to establish possible genetic connection. Under such conditions, assertions on the fortuitous, echoic, etc. formulaic origins can be valid only when they are based on exhaustive study of a wide range of synonyms extended as far as the real and potential migratory trails.

The historicity of the comparative historical linguistics is an unsettled issue with diametrically opposing opinions, from purists arguing that comparative historical linguistics is unable to penetrate or reflect historical processes to practitioners associating linguistics with specific political events. A holistic assessment of the lexical material in the context of the proposed concept allows to link linguistics with history and the disciplines as diverse as biology, economy, ethnology, and technology, and to use the mutually corroborating evidence to affirm the proposed concept.

The trait of paradigmacity is a most powerful tool attesting to the genetic connection, it is one of the tenets of the IE comparative linguistics. The observed transfer of paradigms, traceable from the Türkic to Germanic and on to the English corroborate the proposed concept. Generally, members of paradigmatic transfer share common traits: their IE etymology bravely declares them to be “of unknown origin” or spottily suggests easily confuted mostly incredible etymology; in paradigmatic transfer cases cognates are endowed with their separate mutually unrelated “PIE proto-word” origins; the cases of paradigmatic transfer slip undetected and unaddressed; morphology-related paradigmatic transfer is ignored.

The trait of paradigmacity is a most powerful tool attesting to the genetic connection, it is one of the tenets of the IE comparative linguistics. The observed transfer of paradigms, traceable from the Türkic to Germanic and on to the English corroborate the proposed concept. Generally, members of paradigmatic transfer share common traits: their IE etymology bravely declares them to be “of unknown origin” or spottily suggests easily confuted mostly incredible etymology; in paradigmatic transfer cases cognates are endowed with their separate mutually unrelated “PIE proto-word” origins; the cases of paradigmatic transfer slip undetected and unaddressed; morphology-related paradigmatic transfer is ignored.

The reason for such inept treatment is that the bulk of the accumulated etymological studies is a century old, from the time of horse carriages and inchoative nation-states with high attention to creation their historical narratives. Nearly all studies were produced in Europe, by the European scholars, in accordance with the European norms of the day, and for the European consumption and mentality; exemplary pedantry was held in high esteem, it did not need an open mindset, abhorred scientific heresy, and had powerful tools to curb deviance. Lexical similarity and cognates were not only a base of the science, they were the science. Still, other than a general concept, the tool of the paradigmatic transfer is in its incipiency, a theoretical tenet in waiting for practical implementation. In support of the proposed concept, the method used is duly concerned with cases of paradigmatic transfer in their many configurations, and seeks to detect each case as the most credible evidence.

While the methodology used in support of the proposed concept in many respects conflicts with the tenets of the “normal” comparative historical linguistics, the method is productive in its approaches, in the spectrum of examined factual material, analysis, the width of corroborating evidence, and achievement of credulity for the proposed concept.

Given that English is a considerably modified version of Anglo-Saxon, the most expedient approach would be a direct comparison of Anglo-Saxon versus Türkic, a task beneficial for the present examination and fruitful for the IE linguistics. That such comparison, in spite of a plethora of documented indicators, has never been conducted, is a monumental methodological flaw. The expediency of the Anglo-Saxon – Türkic comparison would however miss English Turkisms absorbed via other languages that formed the English substrate, particularly Frisian, Celtic, and Gothic components. Numerically limited, such components are invaluable testimony to the richness and complexity of the English linguistic history; to deliberately abandon them would distort the evolution of a living language. The method used in support of the proposed concept, in contrast, attends to the substrate Turkisms irrespective of their source.


The English substrate words of Türkic origin are scattered in numerous works of mostly non-linguistic nature, where they are accidental to the topic of the work. A few of them are picked up from the linguistic work of G.Shuke 2010, who compares substrate lexicons of the Latvian and Russian vs. the Türkic, and unwittingly cites numerous English cognates in the Latvian and Russian Türkic-based substrates. Numerous Christian terms of the Türkic origin are analyzed in the work of Yu.N. Drozdov, 2011. The Old Türkic Dictionary (OTD, 1969), though extracted from the eastern Türkic languages, includes numerous words of the Türkic-English substrate lexicon, and hints at more words, of which only derivatives or some particular forms were recorded, leaving the stems to be produced by truncating their word-forming suffixes. G.Clauson Etymological Dictionary of pre-13th c. Turkish (EDT, 1972), though also extracted from the eastern Türkic languages, includes most of the cognates listed in this compendium, frequently indirectly and directly translated with their English cognates. A large number of Turkisms in Russian language covered in the work of E. Shipova Dictionary of Turkisms in Russian Language have matching English cognates identified in etymological dictionaries of English lexicon as “of unknown origin” or otherwise listed with partial or dubious etymology, frequently bent to Indo-European side. There is a tendency to use semantically unrelated Skt. examples as real cognates, just to make a point. In contrast with the flexive IE, the Türkic stems are not readily changeable. Generally, the romantic IE unattested *reconstructions of the English substrate lexicon are leaving a bad taste, confirming the old adage that Indo-European reconstruction inevitably has a character of a scientific fiction; not too numerous IE *stem conjectures are intentionally left out as confusing noise.

The listings of Turkisms in this compendium is intentionally limited to a number of 800, to prevent the task from being open-ended. A potential for increase is obvious even within the deliberately restricted scope, many words have semantically distinct distinct derivative and allophonic versions, like the pair dip and deep, each with a trail of its own allophones and derivatives. Dozens of words are mentioned here as examples without elucidation, dozens more not mentioned here are obvious cognates. Increased inventory adds little to advance the concept. An expansion of the list has a trend of diminishing returns: although the words are innately English, the frequency of their use tends to decrease, with a net result that adding quantities does not materially increase the degree of the role Turkisms play in the modern English.

The whole body of the documented Türkic words in English can comfortably stand on its own without any non-attested stem forms. To illustrate this point, here is a list of words that are practically the same in the two languages, they only differ by divergent conventional Romanization.

Sample list of cognate Türkic–English words

These English words can be substituted in casual speech with their properly suffixed Türkic counterparts, and nobody would hardly notice (like kel- > kelter vs. kilter, qatïɣ > qatïɣir vs. category, kast- > kastis vs chastise);
just the opposite, replace them with some colloquial English or European versions, and they would stand out: compare with European (Fr. (de)calage, categorie, chatierai, Gmn. Gleich, Kategorie, kasteien respectively), or with the barely recognizable 10th c. Old English forms hlaford “lord”, brohte “brought”, scolde “should”, licode “liked”, and the like. In many cases, Türkic counterparts look like dialectal forms from the other side of the river, a type of the Cockney version.
In many cases modern spelling obscures phonetic connection (like ög-, A.-Sax. ege, eage, aga, modern eye); some regular alterations, like b/f, also obscure phonetic connection (like bur “fire”, A.-Sax. fyr); such cases are not included in this selection of easily recognizable samples.
The sample list of readily recognizable words (about 450) is pulled from the dictionaries transcribing the Middle Age Türkic lexicon in Central Asia. It represents about 55% of the 800 lexemes cited in this compendium.

  English Türkic English Türkic English Türkic English Türkic English Türkic
1 abode oba can kanata dumb (adj.) dumur king kengu sing (v.) siŋ- (v.)
2 abundant (adj.) abadan (adj.) can (v.) qan (v.) dune dun lamb -la:n sink (v.) siŋ- (v.)
3 abysm abamu candle kandil durable dür- (v.) loaf lavāš sip (v.) syp (v.)
4 ache àčï cap kap duration dür- (v.) mallet maltu sling salïŋu
5 acid (n., adj.) àčï- (v.) car köl- (v.) duress dür- (v.) mama mamü skull kelle
6 act (v.) aqtar- (v.) caragana qaraqan dust doz mammal meme smile (v., n.) semeye (v.)
7 Adam adam care qorq dye dawa man men soak (v.) saɣ- (v.)
8 again aga (adj.) carnival kerme earl yarlïqa- (v.) many munča (adv.) sock (beating) sok- (v.)
9 age aga case kečä Earth Yer mantra maŋra- (v.) sock (legging) sok- (v.)
10 ago (adj., adv.) aga (adj.) case qaza eat (v.) ye- (v.) marasmus maraz socket sok- (v.)
11 aggrieve aɣrï cash kečä eave ev mare ma: sodden (adj.)) sod
12 alimentation alım cast (v.) kus- (v.) eke eken (v.) matt (adj.) mat (adj.) son song
13 alimony alım cast (v., n.) qïsdï (v.) elbow el me (pron.) min (pron.) sorrel (adj.) sary (adj.)
14 all (n., adj.) alqu (n., adj.) castigate (v.) kast- (v.) eligible (adj.) elïg- (v., n.) means min squeeze (v.) qis- [qys-]
15 Alban àlban (n., adj.) category qatïɣ elite elit- (v.) mengir meŋgü stay (v.) üstü- (v.)
16 alms almak cattle katıl elk elik mental (adj.) meŋtä (adj.) subliminal (adj.) sumlîm (adj.)
17 amen (adj.) ämin (adj.) cause köze:- ell el menu meŋ suck (v.) saɣ- (v.)
18 analogue anlayu (adv.) cavalry keväl elm ilm mickle mig sundry (adj.) sandrı:- (v.)
19 apt apt cave kovı: en- en- mind ming sure (adj.) sürek (adj.)
20 arch arca cavern kovı: endure endür- (v.) mint (v., n.) manat surrender (v.) süründi- (v.)
21 argue (v.) arqu- (v.) cavity kovı: enigma tanığma mist muz swell siwel
22 Arthur artur- (v.) Celt kel- (v.) -er er (morph.) monastery manastar tab tap- (v.)
23 as (adv.) aδïn (adv.) chalant (adj.) čalaŋt Erik erk money manat tablet tü:b
24 ashlar aslïq- chalk chol equal (adj.) egil (adj.) mother mamü taco toqüč
25 ass eš(äk) chastise (v.) kast- (v.) ether äsir mount (v.) mün- (v.) tad tat
26 assign (v.) asïɣ chat (v.) čat- (v.) evacuate (v.) evük- (v.) mouse muš tag toqu
27 astute (adj.) asurtɣuq (adj.) chattel čatïl evict (v.) evük- (v.) much munča (adv.) tack (v., n.) tak- (v.)
28 at (prep.) at- (v.) chatter (v., n.) čatu:r (v.) evil (adj., n.) uvul- murky (adj.) mürki (adj.) tale tili- (v., n.)
29 Augean aqür cheap (adj.) čıp (adj.) evoke (v.) evük- (v.) nose ñü:z talk (v., n.) tili- (v., n.)
30 aught ot (adj.) check chek ewe eve oat ot tall (adj.) tal
31 aurora yar- cheek čaak fare (v., n.) faqr(lïq) oath ötä- (v.) tally (v., n.) tili- (v., n.)
32 awe (v.) ö- (v.) cherub čebär feeling bilin- ocean ӧkän tambourine tambur
33 awhile (adv.) äwwäl (adv.) chill (v., n.) čil gabble (v.) gap- (v.) ogle (v.) ög- (v.) tap (v., n.) tap (v.)
34 baby bebi chip čïp gadding qad ok (interj.) ok (interj.) tap (v.) tap (v.)
35 bad (adj.) bäd (adj.) chisel (v.) čiz- (v.) gaffe ɣafillïq old (adj.) ol- (adj.) tar (v.) ter- (v.)
36 bag bag chitchat (v., n.) čit čat (v.) gain gänč omen aman (adj.) tariff tarïɣ
37 baize bez chop (v., n.) čop- (v.) gamut (adv.) qamit (adv.) on (prep.) on- (v.) tart (adj.) tarqa (n.)
38 bake (v.) baka:č (n.) circle sürkülä (v.) gaze (v.) giz- (v.) onus önüs (adj.) tell (v.) tili (v., n.)
39 bald bül (adj.) clinch (v.) qïlinč (v.) gene gen- (v.) orate (v.) orı: (n.) terrain ter- (v.)
40 bale (v., n.) bele- (v.) coach (v.) köch (v.) gift kiv- (v.) ore öre: theriacum tiryak
41 barge (n.) barq coagulate (v.) qoyul- (v.) gird (v.) qur- (v.) ortho- örti- thick sik
42 bark barq cock (latch) kök give kiv- (v.) other ötürü (adj.) thief tef
43 baron baryn cock (fowl) kök God kut otter ätär think (v.) saq-
44 bash (v.) baš (v.) cockney köken goose qaz ought ötä throne tören
45 bath (v.) bat (v.) colon kolon gore (v.) göres- (v.) owe (v.) oye- (v.) tick (v., n.) tiki
46 be (v.) buol- (v.) colossal qolusuz Gorgon qörq- ooze (v.) ӧz (v.) tick tik- (v.)
47 bear (v.) ber- (v.) con- (pref.) kon- grey (adj.) ğır (adj.) pat pata (v.) till (v.) til- (v.)
48 beetle bit coney, cony kuyan groom görüm(čï) penny peneg time timin (adv.)
49 Belgi (adj.) Belgü (adj.) cook kok- (v.) guest göster pot patır tire teyir
50 bellow (v.) belä- (v.) copious (adj.) köp (adj.) gut kut pour (v.) pür tit tiši:
51 belt bel corset qursa habitat oba push (v.) puš- (v.) to (prep.) tu- (v.)
52 berm bürma count köni hador (OE) xatär pussy (n., v.) päsi (n.) toe toy
53 bet (v., n.) büt- (v.) courage kür (adj.) hard (adj.) qart (adj.) quality qïlïɣ toilet tölet
54 big (adj., adv.) big court qur- (v.) hare horan queue toll tol
55 bill (v., n.) bil- (v.) cousin qazïn haze häzl quit (v.) ket- tomb tumlu
56 bill bilä- (v.) cove kovı: heap kip robe rop tooth tiš
57 bode (v.) bodi cow coy Helen ellen- (v.) sag (v.) sök- (v.) top töpü
58 bodega butïq coy (adj.) köy- (v.) herd kert saga savag- (v.) topple topul
59 body bod crime krmšuhn hilarious (adj.) güleryüz (adj.) sagacity sag torah tör
60 bog bog cue host göster sage sag toss (v.) toš- (v.)
61 bogus (adj.) bögüš (adj.) cull (v.) čul- (v.) I (arch. ic) ič (es) sail (v.) salla (v.) tuber tü:b
62 boil bula- (v.) culture kültür- (v.) ideal (adj.) edil (adj.) sale sal- (v.) truth dürüst
63 bong böŋ cup kap idyl (adj.) edil (adj.) sane san- (v.) tuck (v.) takın- (v.)
64 boot bot cure (fix, v.) kur- (v.) ilk ilk sanity san- (v.) turkey turuhtan
65 booze (v.) buz (v.) cure (food) kuri:- (v.) in (prep.) in (n.) sapphire sepahir turf ter- (v.)
66 bore (v.) bur- (v.) curd ko:r inch (n., v.) ınča sari sarïl (v.) turn (v.) tön (v.)
67 Boris böri curt (adj.) qïrt (adj.) inn (n.) i:n (n.) sash saču: twat tat
68 boss boš (adj.) curve qarvï (adj.) jack (v., adj.) cak- (v.) satisfy (v.) satsa- (v.) udder ud
69 botch (v.) bud- (v.) curse qur- (v.) jag čak(k) satyr satir ululate (v.) ulï- (v.)
70 boutique butïq curtain qur- (v.) jam jem savant savčï (v.) un- an- (morph.)
71 box boɣ dad dedä jar jart savvy savan (adj.) under (adv.) aŋıttır- (v.)
72 brother birader dam da:m jar (v.) jar- (v.) say (v.) söy- (v.) unite (v.) una- (v.)
73 bulge (v., n.) beleg (n.) dash (v., n.) taš- (v.) jeer (v.) jer- (v.) sea si until anta
74 bunch (v., n.) bunča (adv.) dash (n.) taš- (n.) jig (v.) jïq (v.) secede ses- (v.) us (pronoun) ös (pronoun)
75 bundle (v., n.) bandur- (v.) dawn tang (taŋ) jig (n.) jig secret soqru usher (v.) üšer (v.)
76 burl burnï deep dip jog (v.) jag (v.) sector chektür vacuum evük- (v.)
77 bursary bursaŋ delve (v.) del- (v.) jolly (adj.) yol sell (v.) sal- (v.) vat but
79 bust basta dementia dumur journey jorï (v.) sepia sepi- (v.) voucher vučuŋ
80 butt büt derrick terek juice seize (v.) sız- (v.) while äwwäl (adv.)
81 cab qab/qap descend (v.) düšen (v.) jut (v.) jalt (adj.) seizure sïzğur- (v.) whip yip
82 cabbage qabaq diadem didim keen (adj., v., n.) qïn- (v.) sever (v.) sevrä- (v.) wig yü:g
83 cackle (v.) kakla- (v.) dick dık- (v.) keep (v., n.) kap- sharp (adj.) šarp (adj.) will (v., n.) bil- (v.)
84 cadre kadaš dike dık- (v.) Kent keŋit- (v.) she (pron.) šu (shu) (pron.) yacht yaɣ- (v.)
85 caginess qïjïm dingdong daŋ doŋ kick (v., n.) kik- (v.) sheep sıp yell (v.) yel (v.)
86 cairn kayır dip dip kill (v.) öl- (v.) shilling sheleg yet (adv.) yet- (v.)
87 cake kek do tu- kilter kel- (v.) shit (v., n.) šıč- yield (n., v.) yılkı:
88 call qol doll döl kin kin/kun/kün shock (v., n.) šok- (v.) you (pron.) -üŋ (pron.)
89 calm (v., n., adj.) kam- (v.) don (v.) ton- (v., n.) kind (adj.) keŋ (adj.) short (adj.) qïrt (adj.) young yangi:
90 callous (adj., v.) qal (adj., v.)         sick (adj.) (ill) sık- (v) Yule yol
91             sin cin (jin) youth (n., adj.) yaš (adj.)
              sinew siŋir yuck (excl.) yek (n., adj.)
                  yummy (adj.) yemiš (adj.)

Of the approximately 450-word sample list of readily recognizable words, about 320 words are identical semantically and strikingly close phonetically. The main differences are few morphological endings (like -age in courage vs. kür-), fluctuation of vowels (like amen vs. ämin), voiced/unvoiced d/t, b/p, g/k, and the like. Some of these words are amazingly old, they display attested endurance and conservatism over record-grade timespans. The title earl vs. yarlïqa- ascends to the 21st c. BC, 4200 years ago, it means “councilman”, and refers to the title of the Gutian ruler of Akkad. The Augean vs. aqür ascends to the Greek legends of the 1st millennium BC, 3000 years ago, it literally means “horse stables”. The Adam vs. adam and ewe vs. eve ascend to the Hebrew Bible, they literally mean generic “man” and “female”, the Bible story ascends to the 1st millennium BC, 3000 years ago. A real record-setter could be cairn vs. kayır, the first cairns belong to the 6th millennium BC, 8000 years ago, they belong to the dawn of the Kurgan Cultures, cairn literally means “heap of stones”. Their Gaelic and Scottish forms could only come with the Celtic circum-Mediterranean migration, arriving to Europe in the 28th c. BC, 4800 years ago. The selection of enduring words (about 320) represents about 40% of the 800 lexemes cited in this compendium, it demonstrates an amazing lexical conservatism traceable over millenniums.

Sample list of identical Türkic–English words
  English Türkic English Türkic English Türkic English Türkic English Türkic
1 Adam adam caragana qaraqan eligible (adj.) elïg- (v., n.) mock (v.) -mak suck (v.) saɣ- (v.)
2 amen (adj.) ämin (adj.) Charlemagne Charla-mag elite elit- (v.) monastery manastar sundry (adj.) sandrı:- (v.)
3 -an (pl.) -an (morph.) chat (v.) čat- (v.) elk elik mouse muš sure (adj.) sürek (adj.)
4 apt apt chattel čatïl ell el much munča (adv.) surrender (v.) süründi- (v.)
5 arch arca chatter (v., n.) čatu:r (v.) elm ilm murky (adj.) mürki (adj.) swell siwel
6 archaic (adj.) arca cheap (adj.) čıp (adj.) en- en- oat ot tab tap- (v.)
7 are (v.) -ar (v., n.) check chek endure endür- OK (interj.) ok (interj.) tad tat
8 argue (v.) arqu- (v.) cheek čaak enigma tanığma omen aman (adj.) tag toqu
9 Arthur artur- (v.) chill (v., n.) čil -er er on (prep.) on- (v.) tack (v., n.) tak- (v.)
10 ass eš(äk) chip čïp Erik erk onus önüs (adj.) tale tili- (v., n.)
11 at (prep.) at- (v.) chitchat (v., n.) čit čat (v.) ewe eve orate (v.) orı: (n.) tall (adj.) tal
12 Augean aqür chop (v., n.) čop- (v.) gader qad ore öre: tally (v., n.) tili- (v., n.)
13 aught ot (adj.) circle sürkülä (v.) gamut (adv.) qamit (adv.) ortho- (adj. prefix) örti- (v.) tambourine tambur
14 awe (v.) ö- (v.) clinch (v.) qïlinč (v.) gaze (v.) giz- (v.) otter ätär tap (v., n.) tap (n.)
15 awhile (adv.) äwwäl (adv.) coach (v.) köch (v.) gene gen- (v.) ought ötä tap (v.) tap- (v.)
16 baby bebi coal kül/köl goose qaz owe (v.) oye- (v.) tar (v.) ter- (v.)
17 bad (adj.) bäd (adj.) cock (latch) kök gore (v.) göres- (v.) ox öküz tariff tarïɣ
18 bag bag cock (fawl) kök Gorgon qörq- ooze (v.) ӧz (v.) tell (v.) tili (v., n.)
19 baize bez cockney köken groom görüm(čï) pat pata (v.) terra ter- (v.)
20 bake (v.) baka:č (n.) colon kolon guard qur- (v.) pot patır theriacum tiryak
21 bale (v., n.) bele- (v.) con (v.) qun- (v.) guest göster pour (v.) pür thick sik
22 barque (n.) barq con- (pref.) kon- (n., adj.) gut kut push (v.) puš- (v.) thief tef
23 bark barq cook kok- (v.) haze häzl pussy (n., v.) päsi (n.) throne tören
24 baron baryn copious (adj.) köp (adj.) heap kip queue tick (v., n.) tiki
25 bash (v.) baš (v.) corset qursa Helen ellen- (v.) quit (v.) ket- tick tik- (v.)
26 bastard bas + tard count köni herd kert quite (emph.) ked (emph.) till (v.) til- (v.)
27 bath (v.) bat (v.) courage kür (adj.) hilarious (adj.) güleryüz (adj.) robe rop time timin (adv.)
28 bear (v.) ber- (v.) cousin qazïn host göster -s (pl.) -z (morph.) tire (n.) tire- (v.)
29 bear böri cove kovı: house koš/quš/xüžə -'s (poss.) -si (morph.) tit tiši:
30 beetle bit cow coy ideal (adj.) edil (adj.) sag (v.) sök- (v.) to (prep.) tu- (v.)
31 Belgi (adj.) Belgü (adj.) coy (adj.) köy- (v.) idyl (adj.) edil (adj.) sage sag toe toy
32 belt bel cue ilk ilk sail (v.) salla (v.) toilet tölet
33 berm bürma cull (v.) čul- (v.) in (prep.) in (n.) sale sal- (v.) toll tol
34 bet (v., n.) büt- (v.) culture kültür- (v.) inch (n., v.) ınča sane san- (v.) tooth tiš
35 big (adj., adv.) big cup kap inn (n.) i:n (n.) sapphire sepahir top töpü
36 bill (v., n.) bil- (v.) curb (n., v.) kır jack (v., adj.) cak- (v.) sari sarïl (v.) tor tärä
37 bill bilä- (v.) cure (fix, v.) kur- (v.) jag čak(k) sash saču: torah tör
38 bode (v.) bodi cure (food, v.) kuri:- (v.) jam jem     toss (v.) toš- (v.)
39 bodega butïq curt (adj.) qïrt (adj.) jar jart satyr satir tuber tü:b
40 body bod curve qarvï (adj.) jar (v.) jar- (v.) say (v.) söy- (v.) tuck (v.) takın- (v.)
41 bog bog dad, daddy dedä jeer (v.) jer- (v.) sea si turn (v.) tön (v.)
42 bogus (adj.) bögüš (adj.) dam da:m jig (v.) jïq (v.) secede ses- (v.) twat tat
43 bong böŋ dawn tang (taŋ) jig (n.) jig sector chektür un- an- (morph.)
44 boot bot deep dip jog (v.) jag (v.) sell (v.) sal- (v.) unite (v.) una- (v.)
45 booze (v.) buz (v.) derrick terek jolly (adj.) yol sepia sepi- (v.) until (Prep., Conj.) anta
46 bore (v.) bur- (v.) descend (v.) düšen (v.) journey jorï (v.) seize (v.) sız- (v.) us (pronoun) ös (pronoun)
47 Boris böri diadem didim juice seizure sïzğur- (v.) usher (v.) üšer (v.)
48 boss boš (adj.) dick dık- (v.) keen qïn- sever (v.) sevrä- (v.) voucher vučuŋ
49 boutique butïq dike dık- (v.) keep (v., n.) kap- sharp (adj.) šarp (adj.) wake vak
50 box boɣ dingdong daŋ doŋ Kent keŋit- (v.) she (pron.) šu (shu) was var- (v.)
51 brother birader dip dip kick (v., n.) kik- (v.) sheep sıp while äwwäl
52 bunch (v., n.) bunča do tu- kilter kel- (v.) shilling sheleg whip yip
53 bursary bursaŋ doll döl kin kin/kun/kün shock (v., n.) šok- (v.) wig yü:g
54 bust basta don (v.) ton- (v., n.) kind (adj.) keŋ (adj.) short (adj.) qïrt (adj.) yacht yaɣ- (v.)
55 butt büt dune dun king kengu shove (v.) sav- (v.) yah (interj.) yah (interj.)
56 cab qab/qap duress dür- (v.) loaf lavāš sick (ill) sık- (v) yeah yah (interj.)
57 cabbage qabaq dust doz lamb -la:n sick (vomit) sök- (v.) yell (v.) yel (v.)
58 cackle (v.) kakla- (v.) dye dawa make (v.) -mak sin cin (jin) yet (adv.) yet- (v.)
59 caginess qïjïm earl yarlïqa- (v.) mama mamü sing (v.) siŋ- (v.) you (pron.) -üŋ (pron.)
60 cairn kayır Earth Yer man men sink (v.) siŋ- (v.) young yangi:
61 cake kek eat (v.) ye- (v.) matt (adj.) mat (adj.) sip (v.) syp (v.) Yule yol
62 call qol eave ev mental (adj.) meŋtä (adj.) soak (v.) saɣ- (v.) youth (n., adj.) yaš (adj.)
63 can (v.) qan (v.) eke eken (v.) menu meŋ sock (beat) sok- (v.) yuck (excl.) yek (n., adj.)
64 candle kandil elbow el mint (v., n.) manat sock (cloth) sok- (v.) yummy (adj.) yemiš (adj.)
65 cap kap         son song    


The role of the Türkic substrate in English speech can be evaluated with the help of the word frequency listings, as for example are given in http://en.wiktionary.org/wiki/Wiktionary:Frequency_lists/TV/2006/1-1000, Rev. date 5 September 2013. The ongoing minor tweaking of the Wiktionary lists is inconsequential. The frequency count of the words comes from a collection of TV and movie scripts/transcripts of the actual words said by the speakers. The total word count in the collection at the time was 26,376,342, far in excess of the needs for a present rough assessment. Slight modifications were required to combine entries listed separately, like I, I'll, I'm, I've etc., and varying transcriptions like yes, yeah, ya, etc. Accounting for these peculiarities of the Wiktionary lists makes results more accurate by upping the partial values of frequency and rating for few most frequently used words, but affect little the conceptual valuation.

The usage frequency should not be confused with the composition of the lexis, which refers to the fractions of the entire lexis. The test was conducted in two steps, first on a subset of the common vocabulary containing about half of the examined vocabulary, and then repeated for the other half. The standard experimental method of independent testing examines random subsets of data, in computational linguistics it is applied for statistical validation of reliability and results. The method produces repeatable and predictable outcomes independent of the researcher. The experimental method arrived with conclusively close numbers.

Initial trial of the frequency test, using a 2000-word frequency list and 450-word list, found a match for 129 more frequently used words, producing a combined number of 26.9% frequency. Results of the trial indicated that allowing a conservative 3.1% frequency for the remaining 300 unmatched words, about 30% of the English daily lexicon is retained or is based on the lexical base that originated with the Türkic linguistic family. So few words constitute such a huge portion of the language because we have to use them continuously to keep English the English. Considering the volume of the texts sampled for the frequency listings, in this case 26,376,342 words of text, the results were fairly accurate, and would fall into that ballpark number at any similarly structured frequency test, even if some correspondences are disqualified.

A consequent frequency test, using the same 2000-word frequency list and a 800-word list, found a match for 217 words, producing a frequency value of 36.39%. The last, 2000th, word on the list carried a frequency of 0.0036%; allowing on average for each additional word to add half of that, or 0.0018%, out of the total 800 words the 585 remaining unmatched words would be expected to add a product of 585 words times factor 0.0018% or 1.05%. The total for the 800-word list would be approximated as 37.4%. The result of the expanded calculation agrees within reason with the result of the initial trial, confirming that about 1/3 of the modern English daily lexis originated with the Türkic linguistic family. For the validation of the Türkic substrate hypothesis, precise accuracy is pointless, the impressive result of approximately 1/3 of the lexis amply proves the concept.

Assuming that the 800-word listing of Turkisms covers 2/3 of the entire body of Turkisms in English, and that each Turkism has a trail of 5 cognates and derivatives, a ballpark estimate for the entire body would amount to 7200 words, or 1.5% of the English vocabulary of 500,000 words. The lexis' 1.5% produce 1/3 of the daily vocabulary. It does not take a strenuous effort to appreciate, with no exaggeration, the extent of the Türkic component’s presence in the daily English speech. The top ten most used cognates, the you, I, and, that, this, my, do, time, tell, and make, according to an independent study of 26 mln words of text, comprise a disproportional 16.6% of the non-specialized English usage.

The body of the Turkisms in English constitutes a massive case of multi-faceted paradigmatic transfer, it provides incontrovertible attestation of the Türkic admixture in English. For comparison, the Russian lexis reportedly consists of 25% of the Türkic admixture, in reality much more, with the 25% portion so far documented academically. The numbers in English and Russian are quite compatible, difference lays only in the degree of the admixture. Some native clusters consist exclusively of Turkisms, take the unrefined sexual lexicon: Eng. sex, Ru. sik(at), Tr. sik- (v.) “sex, copulate”, sik (n.) “penis”, Eng. cock, dick, Ru. huy, Tr. kök, küy “penis”, dık- (v.) “erect, stand straight”, and so on, extending to compounds, Cf. Eng. childbearing, Ru. ber(emen), Tr. ber- (v.) “carry”, ko:l (n.) “comrade”.

Considering that the western Türkic languages are severely underrepresented, with only a few chance citations by the Classical authors, that they do not have dictionaries or texts compatible in scope with that of the EDT and OTD, and that numerous old languages are classified provisionally or for various reasons are misclassified, it is very safe to assert that a significant portion of the western Türkic lexicon is not available for comparisons, with a considerable portion of the Türkic substrate lexicon remaining unexplored, and their English counterparts remaining either “of unknown origin” or dubiously etymologized.

A notable device of the common etymological practice is to present the “onomatopoeic” or “echoic” origin as etymology. In most cases such diagnosis is a true non-statement, since different people draw completely different onomatopoeic imitations, and to be valid an assertion of onomatopoeia must be specific to either a linguistic branch or at least to a linguistic family. Comparison of onomatopoeic forms from different sources makes guests from local languages and from different linguistic families readily visible. Diligently applied, onomatopoeia is a valid diagnostic tool on the philological origin, be it a peculiarity, a Sprachbund or a substrate rudiment.

The Türkic–English onomatopoeic correspondences demonstrate closer connections than their many counterparts supposedly grown from the same PIE root. The onomatopoeic roots of any lexicon ascend to the first linguistic experiments, when they were a part of a mimic display needed to convey an idea with inchoate vocabulary. Agglutinative languages allow to observe a gradual, step-by-step, formalization and build-up of the primitive onomatopoeic stems into developed lexemes that stand on their own after grammatical, phonetical, and metaphorical transformations.

(Needs update) Table 1a. Frequency listings for TürkicEnglish correspondences
Sorted by column Rank
Orthography is adjusted for phonetical clarity; the ɣ, ŋ, and x = kh are retained
The column “Rank” reflects relative sequential standing by frequency in English
Allophones are combined into a single summary value marked by symbol Σ
No English Türkic Rank Frequency No English Türkic Rank Frequency No English Türkic Rank Frequency
1 you -üŋ (pron.) 1 4.63% 73 jack cak- (v.) 257 0.05% 145 count köni 896 0.01%
2 I ič (es) 5.04% 74 old ol- (adj.) 264 0.05% 146 bill bil- (v.) 902 0.01%
3 to tu- (v.) 3 3.12% 75 money manat 271 0.05% 147 bill bilä- (v.) 902 0.01%
4 and (conj.) anta (conj.) 6 1.92% 76 son song 276 0.05% 148 short qïrt (adj.) 943 0.01%
5 that šu (pron.) 7 1.57% 77 girl kyr 286 0.05% 149 gift kiv- (v.) 967 0.01%
6 me min (pron.) 10 1.18% 78 world àbïl 299 0.04% 150 owe oye- (v.) 972 0.01%
7 in in (n.) 13 1.01% 79 hurt ur- (v.) 310 0.05% 151 bitch bi 977 0.01%
8 this šu (pron.) 14 0.95% 80 boy bo:y 311 0.04% 152 Earth Yer 990 0.01%
9 yes yah (OFris.) 89 0.90 81 while äwwäl 316 0.04% 153 box boɣ 993 0.01%
10 my -m 20 0.80% 82 kill öl- (v.) 323 0.04% 154 judge ayg- (v.) 996 0.01%
11 not ne (part.) 23 0.74% 83 hard qart(adj.) 324 0.04% 155 mama mamü 1013 0.01%
12 do tu- 24 0.74%   until anta 326 0.04% 156 Adam adam 1026 0.01%
13 be buol- (v.) 25 0.73% 84 car köl- (v.) 327 0.04% 157 bag bag 1029 0.01%
14 on on- (v.) 26 0.73% 85 yet yet- (v.) 335 0.04% 158 deep dip 1041 0.01%
15 was var- (v.) 28 0.70% 86 once ön (adv.) 339 0.04% 159 key kirit 1054 0.01%
16 we ös (pron.) 29 0.69% 87 second eki 350 0.04% 160 crime krmšuhn 1057 0.01%
17 so aša (adv.) 32 0.64% 88 truth dürüst 353 0.04% 161 joke elük 1069 0.01%
18 all alqu (n., adj.) 34 0.60% 89 face yü:z 355 0.04% 162 push puš- (v.) 1089 0.01%
19 are -ar (v., n.) 36 0.58% 90 cause köze:- 360 0.04% 163 boss boš (adj.) 1094 0.01%
20 will bil- (v., n.) 76Σ 0.58% 91 wife ebi 365 0.03% 164 seat čıj- (v.) 1120 0.01%
21 get qay- (v.) 42 0.48% 92 use tusu 367 0.03% 165 brain beini 1130 0.01%
22 yeah yah (interj.) 47 93 heart chäre 377 0.03% 166 hide qoy- (v.) 1131 0.01%
23 she šu (shu) 50 0.42% 94 many munča 386 0.03% 167 hide qujqa 1131 0.01%
24 can qan, kanata 51 0.41% 95 case kečä 393 0.03% 168 age aga 1142 0.01%
25 think saq- 54 0.39% 96 case qaza 393 0.03% 169 sell sal- (v.) 1149 0.01%
26 go git 60 0.38% 97 turn tön (v.) 394 0.03% 170 quit ket- 1150 0.01%
27 at at- (v.) 62 0.35% 98 trust dörs (t) 398 0.03% 171 faith vara 1157 0.01%
28 how qalï 63 0.33% 99 check chek 399 0.03% 172 board batɣa 1195 0.01%
29 see süz- (v.) 66 0.40% 100 means min 413 0.03% 173 kick kik- (v.) 1209 0.01%
30 come qam- (v.) 67 0.29% 101 brother birader 414 0.03% 174 cat četük 1217 0.01%
31 good kut 68 0.29% 102 ago aga (adj.) 417 0.03% 175 yep yah (interj.) 1226 0.01%
32 see süz- (v.) 69 0.29% 103 sit čıj- (v.) 424 0.03% 176 bunch bunča 1231 0.01%
33 as as- (v.) 71 0.26% 104 watch aɣtur- (v.) 433 0.03% 177 peace barısh 1237 0.01%
34 will bil- (v.) 75 0.58% 105 question kušku 458 0.03% 178 cash kečä 1260 0.01%
35 time timin (adv.) 78 0.25% 106 far ıra:- 461 0.03% 179 nose ñü:z 1263 0.01%
36 mean many (mahny) 83 0.23% 107 hit it- (v.) 482 0.02% 180 king kengu 1291 0.01%
37 tell tili (v., n.) 84 0.23% 108 child koldaš 500 0.02% 181 laugh gül- 1337 0.01%
38 hey ay (interj.) 87 0.22% 109 young yangi: 510 0.02% 182 smile semeye (v.) 1339 0.01%
39 yes yah (OFris.) 92 0.21% 110 fire bur- 525 0.02% 183 cost kı:z 1355 0.01%
40 been buol- (v.) 94 0.21% 111 shit šıč- 530 0.02% 184 sing siŋ- (v.) 1378 0.01%
41 some kim (morph.) 99 0.20% 112 cut kes- (v.) 540 0.02% 185 foot but 1380 0.01%
42 say söy- (v.) 102 0.19% 113 quite ked 541 0.02% 186 tree terek 1392 0.01%
43 ok ok (interj.) 103 0.19% 114 sick sık- (v) 544 0.02% 187 butt büt 1418 0.01%
44 take teg- 104 0.19% 115 sick sök- (v.) 544 0.02% 188 cry qïqïr- (v.) 1421 0.01%
45 us ös (pronoun) 107 0.19% 116 eat ye- (v.) 548 0.02% 189 guard qur- (v.) 1432 0.01%
46 make -mak 109 0.17% 117 bet büt- (v.) 595 0.02% 190 cake kek 1435 0.01%
47 too de (adv.) 114 0.16% 118 lie yalgan (v.) 599 0.02% 191 cup kap 1452 0.01%
48 sure sürek (adj.) 115 0.15% 119 dear terim 602 0.02% 192 taste tat- (v.) 1455 0.01%
49 over up- 118 0.15% 120 body bod 621 0.02% 193 land elen < el 1461 0.01%
50 man men 128 0.15% 121 worse uvy 626 0.02% 194 breath bu:r 1502 0.01%
51 uh yah (interj.) 133 0.14% 122 ass eš(äk) 655 0.01% 195 band ba- (v.) 1527 0.00%
52 much munča (adv.) 140 0.13% 123 secret soqru 672 0.01% 196 ought ötä 1545 0.00%
53 any ne: 141 0.13% 124 luck àlïč 675 0.01% 197 hall qalïq 1550 0.00%
54 give kiv- (v.) 148 0.12% 125 touch toqï (v.) 681 0.01% 198 bastard bas + tard 1552 0.00%
55 talk tili- (v., n.) 153 0.11% 126 cold xaltarä 693 0.01% 199 guest göster 1564 0.00%
56 God kut 155 0.11% 127 food apat 699 0.01% 200 jerk jul (v.) 1592 0.00%
57 stay üstü- (v.) 235Σ 0.11% 128 act aqtar- (v.) 738 0.01% 201 cousin qazïn 1604 0.00%
58 find yind- (v.) 159 0.10% 129 top töpü 742 0.01% 202 skin saɣrï 1613 0.00%
59 again aga (adj.) 161 0.10% 130 swear vara- (n.) 749 0.01% 203 dumb dumur 1662 0.00%
60 call qol 165 0.10% 131 black belä- (v.) 751 0.01% 204 hole ol- (v.) 1670 0.00%
61 feel bil- 174Σ 0.13% 132 less es- (adv.) 762 0.01% 205 bear ber- (v.) 1684 0.00%
62 first bir 174 0.09% 133 dog dayğa:n 769 0.01% 206 bear böri 1684 0.00%
63 other ötürü (adj.) 173Σ 0.09% 134 till til- (v.) 776 0.01% 207 scare qor 1704 0.00%
64 home qom 179 0.09% 135 till teg (adv.) 776 0.01% 208 gold al(tun) 1713 0.00%
65 put (v.) put- (v.) 186 0.09% 136 evil uvul- 782 0.01% 209 tie taŋ- (v.) 1724 0.00%
66 day dün 186 0.08% 137 eye ög- (v.) 787 0.01% 210 tad tat 1728 0.00%
67 keep kap- 187 0.08% 138 calm kam- (v.) 807 0.01% 211 sea si 1760 0.00%
68 stop top 191 0.08% 139 court qur- (v.) 818 0.01% 212 queen yeŋä 1791 0.00%
69 big big 204 0.07% 140 wake vak 833 0.01% 213 vampire ubyr 1799 0.00%
70 kind keŋ (adj.) 210 0.07% 141 message muštu 837 0.01% 214 coat gömlek 1802 0.00%
71 guess us- (v., n.) 217 0.07% 142 stick tik- (v.) 859 0.01% 215 color kula 1878 0.00%
72 care qorq 219 0.07% 143 write 'rizan (v.) 866 0.01% 216 master bögü: 1885 0.00%
  bad bäd (adj.) 220 0.07% 144 early ertä- 868 0.01%   above bava 1914 0.00%
  woman ebi- + men 263 0.07%             shock šok- (v.) 1951 0.00%
  dad dedä 226 0.06%                    
  mother mamü 228 0.06%                    
  baby bebi 234 0.06%                    
  father ata 236 0.06%                    
  might bögü: 240 0.06%                    
  mind ming 244 0.06%                    
  hell qalïq 248 0.06%                    
  own oye- (v.) 252 0.06%                    
  house quš 253 0.05%                    

Swadesh list

To gauge linguistic kinship of two lexicons, is used a Swadesh list of “basic words”, it provides some uniformity on the level of a linguistic gage. It is a kind of a canonical list of the IE “basic words”, ascending to the kernel of the hypothesized PIE language, supposedly found in the majority of the 439 IE languages, and in the majority of its individual branches. Based on the faulty Family Tree model, and based on what was held as specifically IE “proto-language” or a grouping of kindred languages, Swadesh list is conceptually wrong, it is actually a random collection of lexemes tailored for a special pleading case. This is a list that substitutes for a missing definition of the IE paradigm, a surrogate of the definition agreed to by the “consensus” of the IE linguists. It is a core fundament of the IE linguistics. The list was assembled on a faulty presumption that all languages use the same basic toolbox of grammatical nouns, verbs, adjectives, etc., and compatible syntaxes that allow them to be compiled into phrases. That presumption fits well into the bulk of the IE languages, but can’t be generalized for all earthly languages. Even within the IE family are oddballs that defy that presumption and add uncertainty to other traits contradicting their IE classification. Comparisons of languages deviating from the basic presumption with those that are in full accordance with the presumption encounter obstacles that necessitate introduction of additional presumptions to synchronise comparisons, Cf. languages where nouns can serve as adjectives vs. languages where adjectives are formed morphologically. The Türkic linguistic group falls into noncompliant category, comparisons are augmented by vaguely interpreted presumptive caveats. With that qualification, Swadesh list generally remains a useful, but far from universal, tool. But it is its randomness that makes it a quasy-universal tool for linguistic studies across the Earth languages. Swadesh list is routinely used as an objective tool to get a rough statistical assessment on similarities and differences between sampled languages. In the course of the present study, Swadesh list is used to objectively assess the Türkic contribution to the English language, and as a side benefit the Türkic contribution to the IE languages.

According to the Swadesh 207 count, the basic vocabulary of English has 113 or 55% IE *reconstructed roots, another 31 words or 15% are of dubious IE *roots, 63 words or 30% are definitely non-IE, and the combined dubious and positively non-IE words constitute 94 words or 45% of the Wikipedia “Appendix: Proto-Indo-European Swadesh list”. The heavy doze of non-IE component supplies ready candidates for statistically credible admixture. To be validated, these “essential basic words”, and the rest of the candidates need to undergo a scrupulous testing procedure suggested by G.Doerfer 1981. It has been estimated that the English lexis consists of 70% Romance lexicon, and the balance of Germanic, Celtic, and unknown. However, within the basic vocabulary of, say, 2000 words, Romance holds a considerably more modest place, and still smaller place in a base lexicon of, say, 1000 words, and a miniscule place in the usage frequency of the daily vocabulary. The above frequency test shows that a mere 10.75% or 216 words of the 2000-word basic vocabulary pull disproportionate 36.35% of the usage frequency, significantly contributing to the daily lexicon at the expense of the Romance components, while leaving enough room for the prehistoric Germanic and Celtic languages of the Northern Europe.

Numerous IE *roots come as close allophones of the real Türkic forms. The PIE reconstructions for these words create some phantom lexicon that would be nicely dubbed a Proto-Türkic (PT) lexicon (like the PIE “reconstruction” *h3okw for eye, eidetic to the real attested Türkic root ög-). A dictionary of PIE reconstructions composed of such real Türkic words would replace phantom fictions with real attested Türkic forms. In essence, that dictionary would attest to the Türkic origin of the IE languages within the scope of the Türkic part. The portion of such PIE basic words in the Swadesh 207 English list, plus the portion of the non-IE words of Türkic origin in the Swadesh 207 English list would provide a numerical approximation on the Türkic contribution to the English language. A fairly large proportion of these words permeate the IE Romance, Germanic, and Slavic branches of Europe, and in the IE linguistics is taken exclusively as an absolute criterion of Indo-Europeanism. It was keenly noted that in respect to the core vocabulary, the eastern IE languages fall far behind European languages. Since there is no definition of the IE paradigm, an eastern version of the Swadesh list may as well demonstrate that the western IE languages fall far behind the eastern IE languages.

Sample list of PIE “reconstructions” versus the Türkic originals
Numbers correspond to the conventional numbers on the Swadesh list
No English Türkic Proto-Indo-European
1 I ič (es) *éǵh2
8 that šu (pron.) *só
16 not ne (part.) *ne
19 some kim (morph.) *kaylo-
42 mother mamü *méh2tēr
43 father, dad ata (father)
dede (= daddy, dad)
*átta, *ph2tḗr
51 tree terek *dóru
74 eye ög- (v.) *h3okw
75 nose ñü:z *nā́s
80 foot but *pṓds
90 heart chäre *ḱḗr
93 to eat ye- (v.) *h1ed-
176 black belä- (v.) *bhleg-
178 day dün *dei-n
202 in in (n.) *(h1?)en

About 62 of the Swadesh list 207 words correspond to the respective entries in the 800-word listing of English Turkisms. Most of them do not have any hypothesized “*PIE proto-words”. That confirms the complex origin of English. The results are random coincidences between two independent listings, a complete examination would produce a higher number of matches. The number 62 is approximate because linguistic notions may be expressed by more than a single word, Cf. the notion “father”, with English forms father and dad (daddy) corresponding to the Türkic forms ata and dede, both matching the notion of the Swadesh list without a particular preference to the either word. In few cases PIE hypothesizes more than a single form, offering a choice to suit each of the different IE branches; at least some of the “PIE” forms ascend to the Türkic origin shown in the listing, Cf. ič – I – *éǵh2 and the like. The entries like this and that, differentiated by locative suffix and similar inflectional markers, may originate from a single form in languages like Türkic, where a series of locative suffixes form different locative aspects; such duplicative derivative entries can't be classed as “basic words”, making the valid words number less than 207.

The 62 Türkic–English correspondences in the Swadesh list constitute exactly 30% of 207 words, or higher accounting only for valid Swadesh entries. That number matches other lexical valuations of around 1/3 of the English daily lexis derived from the Türkic phylum.

Properties like phonetic and spatial dispersion, overlaid on the historical background extracted from written, archeological, and genetic dating, allow to narrow dating of the most components to not much earlier than the 1st millennium BC, the time of the reverse wave of the linguistically European refugees returning to the Central and Southern Europe, and the Sarmat influx into the Northern Europe. Such conceptual dating puts into question the much older hypothetical dating of the remaining PIE part of the list, the dating deduced nearly exclusively from the linguistic theoretical considerations. Similarly, some postulations concerning temporal changes within IE family are not applicable to the “reconstructions” of the non-IE admixtures, Cf. ič – I – *éǵh2 and the like. Applying to the case of English the attrition rate suggested as a valid rate for any language would increase the hypothetical fraction of English Turkisms at the turn of the eras to about 50%, a most questionable scenario. Interpolating observations for some cases to the entire class of words intuitively held as IE leads to erroneous “reconstructions” for all cases materially different from the cases underlying the temporal postulations. On the other hand, some unadulterated words, like the ata “father”, are also shared by some Amerindian languages, attesting to their existence prior to the 15th millennium BC Amerindian migration from Asia to the Americas. Such deep connections protrude far beyond the PIE temporal horizon.

The Swadesh list's Turkisms come in 3 flavors, either Proto-Indo-European “reconstructed” forms are some allophones of the attested Türkic form (20 cases, Cf. No. 1 ič ~ *éǵh22), or the English word does not belong to the PIE phylum (18 cases, Cf. No. 15 how), or the PIE “reconstruction” is unrelated to the English word (24 cases, Cf. No. 7 this ~ *kos), or even a combination of some two flavors. These etymological flavors extend far beyond the “basic words” vocabulary, with the edition of the dubious “echoic” origin, they are endemic for the rest of the 800 cited English Turkisms.

Türkic–English–PIE correspondences in Swadesh 207 list
Numbers correspond to the conventional numbers on the Swadesh list
ashes, belly, bird, bite, black, blood, bone, breast, burn, claw, cloud, come, die, dog, drink, dry, ear, egg, eye, fat,
feather, fish, fly, foot, ful, good, green, hair, hand, head, hear, horn, knee, know, leaf, lie, liver, long, louse, meat,
moon, mouth, name, neck, new, night, one, person, rain, red, road, root, round, sand, seed, seed, sleep, small,
smoke, stand, star, stone, Sun, swim, tail, thou, tongue, two, walk go, warm, water, what, white, who, woman, yellow
No Dybo No/p. English Türkic Proto-Indo-European Notes
1 42/335 I ič (es) *éǵh2  
2   you (singular) -üŋ (pron.) *túh2  
4 95/520 we ös (= us) *wéy  
7 86.3/500 this šu (pron.) *kos PIE unrelated to English
8 85/498 that tet (pron.) *só  
15   how qalï   English unrelated to PIE
16 62/416 not ne (part.) *ne, *ney  
17 1/16 all alqu (n., adj.)   English unrelated to PIE
18 52/370 many munča (adv.)   English unrelated to PIE
19   some kim (morph.) *kaylo- English unrelated to PIE
21   other ötürü (adj.) *h2el-yó- PIE unrelated to English
27 5/116 big big   English unrelated to PIE
30   thick sik *bhenǵh PIE unrelated to English
33   short qïrt (adj.) *mreǵhú- PIE unrelated to English
36 99/527 woman ebi + men   English unrelated to PIE
37 51/367 man (adult male) men *h2nḗr PIE unrelated to English
39   child koldaš   English unrelated to PIE
40   wife ebi   English unrelated to PIE
42   mother mamü *méh2tēr  
43   father dede (= daddy, dad) *átta, *ph2tḗr  
47   dog dayğa:n *ḱwṓ PIE unrelated to English
51 90/509 tree terek *dóru  
53   stick tik- (v.) *ǵhasto- PIE unrelated to English
58 3/87 bark (of a tree) ver   English unrelated to PIE
62 75/470 skin saɣrï *pel- PIE unrelated to English
74   eye ög- (v.) *h3okw-  
75 61/409 nose ñü:z *nā́s  
77 89/506 tooth tiš *h3dónts PIE unrelated to English
80 31/270 foot but *pṓds  
90 40/330 heart chäre *ḱḗr  
93 23/232 to eat ye- (v.) *h1ed-  
95   to suck saɣ- (v.) *dheh1(y)- PIE unrelated to English
101 72/455 to see süz- (v.) *derḱ- PIE unrelated to English
104   to think saq- *tong- PIE unrelated to English
110 43/337 to kill öl- (v.) *gwhen- PIE unrelated to English
113   to hit it- (v.) *pleh2k- PIE unrelated to English
114   to cut kes- (v.)   English unrelated to PIE
122   to come qam- (v.) *gwem-, *gweh-
124 74/468 to sit čıj- (v.) *sed-  
126   to turn (intransitive) tön (v.)   English unrelated to PIE
128 33/278 to give kiv- (v.) *deh3- PIE unrelated to English
130   to squeeze qis- [qys-] (v.)   English unrelated to PIE
135   to push puš- (v.)   English unrelated to PIE
137   to tie taŋ- (v.)   English unrelated to PIE
138   to sew suk- (v.) *syuh1-  
139   to count köni   English unrelated to PIE
140 71/451 to say söy- (v.)   English unrelated to PIE
141   to sing siŋ- (v.) *kan- PIE unrelated to English
146   to swell siwel   English unrelated to PIE
154   sea si   English unrelated to PIE
158   dust doz *pers- PIE unrelated to English
159 22/224 earth yer *dhéǵhōm PIE unrelated to English
160 14/195 cloud bulut *nébhos PIE unrelated to English
167 28/260 fire bur- *h1égnis PIE unrelated to English
171 55/378 mountain mün- (v.) *gwerh3- PIE unrelated to English
176   black belä- (v.) *bhleg-  
178   day dün *dei-n-  
181 15/192 cold xaltarä *gel-  
184   old ol- (adj.) *senh1ó- PIE unrelated to English
186   bad bäd (adj.) *h2wap- PIE unrelated to English
191   sharp (as a knife) šarp (adj.) *h1edh- PIE unrelated to English
198   far ıra:- *wi  
201   at at- (v.) *h1ad  
202   in in (n.) *(h1?)en  


Comparing lexicons and trying to get to the prehistoric level is fraught with lots of white noise. Enthusiastic linguists easily fall prey to their gusto, and the IE paradigm provides lofty examples of that, in loving hands the noise of homonyms and conflations is confused with cognates, mistaking a chance consonance for linguistically meaningful coincidence, and creating a path for phantom PIE concoctions. The process biased with contracted horizon is leaving untoward cognates unexplored. Etymology has two components, a descent from a root and a path that reaches the subject word. A focus on the root leaves out an implied or vaguely presumed path, but it is the path that determines diffusion and development of the word. Circular logics brings circular results, an incorrect premise leads to unsound conclusions, usually salient due to unrealistic semantic stretch. Combined with the simplistic Family Tree model, the wayward finale is inevitable. Circular results obscure the path, spread, and development, obscure distinct tell-tale transformations inflicted in propagation along contrasting Sprachbunds, and in the end lead to a result opposite from that intended. Instead of illuminating very specific linguistic aspects of the history, false etymology smothers them into historical mythology. Lacunas in the records, especially beyond the range of the literary purview, leave some candidates unprovable and undatable except for purely linguistic evidence, like the words bode, boss and vampire.

Linguistics has devised a system of checks and balances to filter the noise out. Computer literacy brings acceptance of mathematical methods to linguistics, abhorred by the old linguistic schools. For quantitative analysis of the established kinship in the lexicon is used Swadesh method, equally applicable to the romantic Genetic Tree and the Wave models. It establishes statistical correlation between different languages using a standardized linguistic test, shedding some light on the linguistic development. Reasonable lexis-related criteria for establishing kinship were formulated by G.Doerfer 1981 , he gave a definition to the perennial linguistic bewilderment of what exactly is necessary to firmly establish genetic relations between two different languages. Evaluation of statistical chance resemblances is offered by M. Rosenfelder 2002 . These criteria rarely apply to morphology and other key linguistic properties, but with consistent transparency in application and similarity in function, it would be difficult, for example, to deny morphological continuity between the Türkic aɣrïüt, English aggravate, and Lat. aggravare, or Türkic baiyar, Russian boyar, and Indian Boyar caste. In all fairness, each etymologized word should be assigned a credibility or confidence weight, equally applied to the proposed IE and non-IE etymologies. Such weighting would allow comparison between alternate etymologies, and would enable to calculate a Bayesian probability for the suggested etymologies for each set of the words.

The concept of paradigmatic transfer defines borrowing of some entire complex of features. In the past few centuries, paradigmatic transfer became a universal tool of acquiring terminology with technology, at first medical and biological terminology based on Latin and Greek, then industrial terminology from industrialized nations, and lately the computer terminology from the American English. On a different level, the paradigmatic transfer may have been a driving force of human evolution since the earliest times. Any complex of linguistic traits borrowed entirely of partially is an example of paradigmatic transfer: typology, syntax, morphology, script, and lexicon; in extreme case nearly entire language is transferred, Cf. Latin American versions of Spanish. Statistically, a probability of chance independent invention of a paradigm is negligible, and any appeals to chance coincidence not supported by evaluation of numerical probability carries no weight, because any additional element, however minor, reduces the value of chance coincidence probability by orders of magnitude. Since civilizations developed not as much on inner innovations as on cultural borrowings, the examples of paradigmatic transfer are innumerable: Lat., Gk., and Türkic terminology was borrowed complete with morphological modifiers, morphological systems, entire alphabets and entire letters, partial alphabets, lexemes and their calques, idioms, folklore, practically any aspect of civilization. The phenomenon of paradigmatic transfer is a powerful diagnostic tool in cases when a mechanical transfer can be positively excluded (Cf. alphabet, economy, ideas); a transfer that necessitates a demographic component (Cf. morphology, alphabet elements, idioms) attests to a -strate model, such as substrate, adstrate, superstrate, etc. A paradigmatic transfer of entire phrases is one of indelible diagnostic parameters.

Numerous phrases in English, and by association in other European languages, are close allophones of the real Türkic forms. Practically, they are a word-for-word re-formulations of the Türkic phrases. The statistical significance of such allophonic phrases is enormous, since each additional phoneme reduces chances for accidental coincidence by orders of magnitude, making genetic connection obvious. The following is some sampling of allophonic phrases taken directly from EDT and similar works, with retained transcription and punctuation. In more than few cases, the literal and literary meanings are identical, a separate literary form is not needed, since the Türkic phrase is transparent without much elaboration:

Phrase Literally Literarily
belin badı belly his he banded his belly he banded (girded)
oldačı kiši old geezer  
öčür kül charcoal quenched coals
ne yedi (ne yemadi, -ma- negation) not eat  
ne tedi (ne temadi, -ma- negation) not told (tell)  
tal bodı tall (and slim) body  
tö:š ba:ğı: teat bag bra, breast binder
yuğrut koyuldi: yogurt coagulated  
ğır aŋdï- greyhound grey ambusher
tıš ağrığ tooth aching  
tiši erdi teeth are  
kutluğ kıvlığ God given  
ič sök- I (am) sick  
yer kaba earth cave earthen cave
ačığ àčï acidic ache bitter ache
tarka àčï tarty ache bitter ache
topildi: yer toppled (kicked) earth earthquake
kurultai be cured (family) ties restore family ties
ınča: munča inch (less) or more more or less (idiom)
ınča: bunča inch (less) or bunch more or less (idiom)
ınča kutluğ kıvlığ inch (less) by God given deprived by God, not so fortunate
kır(mu) durur girl durable virgin daughter
bıšığ siŋir boiled sinew steamed sinew
meŋgü: ber- mengir bear bore (convey) a mengir (gravestone) (for somebody)
ne qayar (qorq) sen (or seni) no care (in) thee
[ni kara þuk (Goth.)]
Thee (do) not care
meni: tonatti: don on me (dress, hat)  
ämin qamuɣ amån (OK, assured) gamut gamut (is) amån (assured)
azı:ma:k ooze make make (to) ooze
küp azi:di: cup oozing cup (is) oozing
jolda ämin jorït on the yule (on the road) amån journey on the yule (road) amån (assured) journey
tilge: ye:r till (furrow/strip) of earth strip of earth

Few examples of extended phrases would illustrate the concept. An arsenal of about 800 words in the present compendium, constituting statistical 35% of the English daily vocabulary, allows to compose many types of phrases where English text is closely mirrored in the Türkic text, recognizably resembling each other in spite of incompatible syntaxes and discrepant morphology. Illustrations are staple phrases, cited in English, Türkic, and German, Turkish, or Slavic (Russian) adjusted for the modern Türkic syntax.

  I + to be + verb    
English I am thinking I am aging I am argue(ing)
German Ich (bin) denke Ich (bin) Alterungs Ich (bin) argumentiere
Türkic Ich (bol) thakip tur [saqip ~ thakip] Ich (buol) ag Ich (bol) arqu- (tur)
Turkish Ich (bol) düshün(üp) (tur) Ben yašlanma am Ben savunarak am


Eng. to be or not to be – this is the question I do argue, making others feel bad
Türk. bul(mak) ya da bulma(mak) - ište alqu (gamu) kušku bu Ötürü bäd hissettiren, ben arqu(mak)
Slavic. áûòü èëè íå áûòü - åñòü âñå ñîìíåíèå âîò Äðóãèõ ïëîõî ÷óâñòâîâàòü äåëàÿ, ÿ àðãóìåíòû äåëàþ
Sl. transl. to be or not to be - is all (gamut) question be Others bad feel making, I argue(make)

Lining up word for word, the English-Türkic-Slavic match looks like this. Few words shown in bold do not have direct correspondence. Square brackets stand for words indicated by agglutinated morphology, round brackets enclose suffixes and suggest synonyms. Experimenting with other Gmc. languages would bring somewhat resembling results.

  to be or not to be – this is the question
With Türkic syntax to be (be'd) or not to be is all (gamut) question be
Türkic transliterated bul(mak) ya da bulma(mak) ishte alqu (gamu) kushku bu
Russian Cyrillic áûòü èëè íå áûòü åñòü âñå (ãàììà) ñîìíåíèå âîò
Russian Roman byt ili ne byt est vse (gamma) somnenie vot
  I do argue, making others feel bad
With Türkic syntax others bad feel making I argue do
Türkic transliterated ötürü bäd bil- [mak] [Ich] arqu(mak) tu(mam)
Russian Cyrillic äðóãèõ ïëîõî ÷óâñòâîâàòü äåëàòü ß àðãóìåíòû äåëàò
Russian Roman drugih ploxo chuvstvivat delat ya argumenty delat

Except for conjunctions, all other words are allophones of Türkic lexemes, with some shifts: Türkic gamu “gamut, all” in English is expressed with synonymous all, and in Russian with IE vse, while the Russian allophone of the Türkic gamu is spelled gamma and designates the whole, all octave; Russian somnenie is a derivative of Türkic ming (miŋ) “mind” and an allophone of English mind with stem mn and IE prefix s-/so- “with”.

A cross-comparison of Slavic, Baltic, and English Turkisms helps to detect less obvious Turkisms in English. Dating is problematic, it is vaguely anchored to the known historical events or archeologically detected known historical events, confirmed by isotope and genetic allele dating. Within the holistic body of the evidence, genetics attest that Germanic people of the European Sarmatia are the same people who originated from the Sarmatian area in the heart of the Eurasia, archeology and genetics assert that with various Kurgan waves they spread out to flood the same area of the Western Europe, they introduced to the Western Europe the identical peculiar runiform alphabet with common graphical symbols and some preserved common phonetics, they share hundreds of common words, and they retained legendary memories of their Asian origin. Under the weight of the evidence, any branch of science would find it scandalous to even question a common origin.


The Old English (OE) is a euphemism for the Anglo-Saxon language, used to create periodization without turning to the historical people who brought the English over to the isles. Whether called OE or Anglo-Saxon in the etymological references, it is still the same old Anglo-Saxon language, documented mostly between 700s and 1050s, after the encroaching of Christianization, and before the Norman conquest. At its dawn, the Anglo-Saxon English used runic script; a modern codified version, that probably does not include all real non-codified graphemes and all sounds peculiar to the languages of its creators, still contains a number of graphemes that display a genetic connection with the Horezmian (Turanian) and Orkhon versions of the Türkic alphabet, and also contains few Roman or Greek letters. The formal transition of Anglo-Saxon literature from the runic to the Latin alphabet took place in the first two centuries in the new lands. Before its migration to the British Isles, runic script was a means of written communication in Scandinavia and on the continent, its monuments are engraved on gravestones and stones, metal, and rare perishable relicts that survived the period of severely enlightening Christianization.

The term Anglo-Saxon in the England's context refers to the conglomerate of tribes that moved to England in force, over extended period, from their initial location at the junction of modern Moravia, Germany, and Poland. As their name implies, initially it was a union of two major components that likely absorbed numerous local fragments, of which Gothic left a detectable trace. The union was further enriched in their location along the southern banks of the Channel, of which Frisian left a detectable trace. The contributions of the local vernaculars are not always readily identifiable, and in dictionaries are cited under an indiscriminate Anglo-Saxon label. Probably, the motley composition of the migrants helped in some standardization of the island's runic alphabet before the illiterate monks denounced it as some type of supernatural satanic device. In view of the historical longevity in the correspondence between the shape of the graphemes and their morphemes which allows tracing of the alphabets to their most remote ancestors, the morphemic reshuffling of the runic alphabet from its original and traceable composition to the haphazard sequence of the Futhark happened in the darkest period of the early Dark Ages. The Germanic Futhark appears without a prehistory, compiled in a rigidly structured form, with its morphemes mechanically reshuffled without any visible taxonomy, as a work of some invisible intelligent design. Stalin and Mao were not the inventors of the methods to mass-convert literate people into illiterate. The oldest runic inscriptions, composed before the miraculous reshuffling, remain systemically undeciphered, and held as written in unknown languages. The later runic inscriptions, composed according to the Futhark sequence, are readily deciphered, and even freely read by scholars steeped in Futhark.

The discovery of the Germanic-looking runiform script in the far-away Siberia was an exciting news in the 18th c. The discoverers were struck by the similarity of the Orkhon script with the Germanic runes. Graphical similarity was so strong that the European zealots had to re-define the very term “runes” as solely “Germanic runes”, leaving other runes to come up with their own terms. The trademark patent did not expire in 17 years, various encyclopedias still define runes as “Germanic runes”, a unique case of copyright protection extending more than 2 millennia backwards. The oldest runic inscription was carbon-dated by the 5th c. BC, it is the Issyk inscription found in 1970 in the nomadic kurgan in the vicinity of the lake Issyk in Kazakhstan, the homeland of the Saka Scythians according to the historical records. The Orkhon discovery hibernated for two centuries, till the Finnish scholars started exploring a possibility of the Germanic runes written in an unknown language with some connection with the Finnish past. Of the Scandinavian people, only Finns came from Asia, they could have brought the runic script to the Northern Europe. Finnish scholars attracted attention of all Scandinavian paleographers, they were the first who published a compendium of the Siberian inscriptions. In 1893 the Dutch Vilhelm Thomsen (1842 - 1927) deciphered the bilingual Türkic-Chinese inscription from Orkhon in Mongolia. After that, the interest to the Türkic script for another half-century again went into hibernation. Any links between the Germanic and Türkic runes were repudiated, any similarity was denounced as fortuitous, any genetic connection as chimerical.

Graphical-morpheme similarity
(Ref. Mukhamadiev Azgar, Ancient Coins of Kazan, Kazan, Tatar Publishing house, 2005, ISBN 5-298-04057-8)
Lat. Turanian letters Türkic letters Unicode
Anglo-Saxon Lat. Turanian letters Türkic letters Unicode
à, ä î, è
č p
i s1
j1 - t1
l1 th -
m Tur. x1 -

Even more striking is the cross-comparison of the whole alphabet sets. Each grapheme of the Anglo-Saxon alphabet finds an exact counterpart in the Orkhon alphabet, with seemingly random shuffling of phonetics. The northern Europeans with high presence of Turkisms in their languages managed to independently invent, sometime at the turn of the eras, a whole set of symbols totally identical to another alphabet used in the Middle Asian coins of the turn of the eras and in the 8th c. Türkic Kaganate. Such a coincidence between two alphabets that reportedly have no connection whatsoever, and are localized at two different ends of the Eurasia, must be ascribed to no less than a miracle, it can't be sanely explained within the framework of the prevailing postulates. If it can be even more miraculous, the shape þ of the grapheme for the voiced interdental th (ð, the δ of the OTD), not found in other alphabets, is identical in the Middle Asian (Turanian) and the Germanic Futhark alphabets. Even today, the depictions of the Latinized Gothic, ONorse, and Anglo-Saxon words use the grapheme þ, first depicted on the Middle Asian coins with Türkic legends.

Statistically, a chance coincidence of the same form and sound from two unrelated alphabets, selected from a limited subset of the forms of all alphabets compiled by humanity during its whole history, is so small that it would require lots of paper to place all preceding zeroes. The long-denounced connection between the Türkic alphabets and the Futhark alphabet is undeniable, the question is only on the mechanism of that connection. Modern forensics asserts that perfect crimes do not exist, any criminals leave traces beyond their capability to wipe them clean. It is only a matter of the willingness and ability of the investigators to to find and analyze the traces.

Futhark and Orkhon scripts
(cited in: Halvorsen I.,2004, http://www.sunnyway.com/runes/origins.html)

Latin and Türkic Languages

Numerous English words are ascribed to the Latin and Latin via French, implying that either the English did not have words for such concept at all, or that the Anglo-Saxon (OE) lexicon was fully supplanted. Numerically, the Romance strata in English is estimated to be on the order of 70%. However, an assessment of the set of 800 Türkic-English cognate words cited below finds that
– much of the corresponding Latin vocabulary is related to the same Türkic substrate that formed the English lexis. Historically, such continuity is consistent with the thesis that most of the “Old Europe” population was vanished by the 3rd mill. BC, and from the 3rd mill. BC to the 1st mill. BC it was replaced by various Türkic populations and their varieties of the Türkic languages.
– a very significant portion of the “Latin” lexicon belongs to the Vulgates, Late and Middle Latin, and is Latin only in name. Although Latin is cited as a dead, frozen language, the cited words belong to a living language. The Turkisms used by post-Classical Latin include some significant proportion that originated from the nomadic nations - Vandals, Burgunds, Alans, etc., and from the Germanic nations, including Anglo-Saxons, Goths, Lombards, and others that influenced the local Latin vernaculars. For centuries, the academical and ecclesiastical Latin co-existed along the Vulgate Latin languages of various nations.
– a portion of the “Latin” lexicon originated in the Gallic space, it is usually denoted as OFr. fr. Lat., where Lat. is an indiscriminate blend of the Classical and post-Classical Latin, and where Burgund and Alan words were assimilated into post-Classical Latin.

An initial trial of the English-Latin-Türkic lexical correspondences, using a preliminary 400-word list of the English-Türkic correspondences, found a match for 127 words, producing a combined 32% of the sample. That number couldn't be applied to the whole body of the Romance in English, which is expected to be much diluted, and be smaller by an order of magnitude. Results of the initial trial indicated that arbitrarily allowing a conservative 10% for the remaining Latin lexis, about 40% of the Latin lexicon is retained or is based on the lexical base that originated with the Türkic linguistic family. The results were fairly reasonable, and should fall into that ballpark number at any similarly structured test, even if some correspondences are disqualified.

A consequent test cited below, using the 800-word list, found a match for 334 words, or 41% of the sample (Table 2a). The result of the expanded comparison agrees within reason with the result of the initial trial. The test covers apparent Turkisms in Latin; it is a statistical indicator of the Türkic-Latin lexis vs. the Türkic-English 800-word list. In respect to the Latin lexis, it is a random lexical selection: 334 words of a random 800-word list. This number can't be applied to the whole body of the Romance in English, which is expected to be much diluted by various admixtures, and be smaller by an order of magnitude. In the context of the Türkic-Latin-English commonality, the number 41% is significantly exaggerated by dubious Lat. cognates. It can be inferred that without Vulgates', Middle and Late Latin words, and dubious cognates, the proportion of the Turkisms of the Classical Latin origin would be reduced by half, to about 20% of a random sample. Allowing another 5-10% to cover the other Latin Turkisms from the remaining portion of a typical 10,000-word Latin vocabulary, a very rough prediction can be made that about 25-30% of the Classical Latin lexis hails from the Türkic phylum. The numbers in English and Latin are quite compatible, difference is but in the degree of admixture.

Visible indicators point to the Latin and English Turkisms originating from much different versions of the Türkic vernaculars, likely separated by timespans measured in millennia, relayed by different sources, planted on completely different substrates, and compared with the geographically far remote OTD/EDT lexis recorded for a time period one millennium later in the English case, and a few millenniums later in the Latin case. The Latin lexicon of the Türkic origin was superimposed, supplanted, and conflated with the English innate lexicon of the Türkic origin, forming a local version of something like a European Türkic-based Sprachbund layer.
12, 13, 14

In all cases, a borrowing from Latin and English into Türkic is positively impossible, especially so in the case of the Central Asian and Far Eastern Türkic languages. In case of Uigur, for example, Uigurs are continuously attested in the Central Asia-Far Eastern region from the 3rd c. BC, before the rise of the Roman Empire on the other end of the Eurasia. Numerous Türkic tribes are attested still further east of the Uigurs. Neither the Romans, nor the English possessed the mobility of the mounted Türkic tribes, used the steppe belt as a transportation corridor, or are known for their mass migrations across Eurasia to effect such borrowing.

Table 2a. Türkic–Latin–English lexical correspondences
No English Latin Türkic English Latin Türkic English Latin Türkic
1 abode habitatio oba cure cura kur- (v.) nascence nasci ña:š
2 abundant abundantem abadan (adj.) curt curtus qïrt (adj.) nose nasus ñü:z
3 abysm abysm abamu curve curv- qarvï (adj.) not non ne (part.)
4 access accessus ačsa:- dad tata, atta dedä obturate obturare tiy- (v.)
5 acid acidus àčï- (v.) day dies dün ocean ocean(us) ӧkän
6 act actus aqtar- (v.) deliver deliber döle- (v.) ogle oculus ög- (v.)
7 age aetas aga dementia dementare dumur old altus ol- (adj.)
8 aggrieve aggravare aɣrï derma derma deri omen omen aman (adj.)
9 agile agilis ačïl descend descendere düšen (v.) on an- on- (v.)
10 aid adiuvare jarï divide divide dil- onus onus önüs (adj.)
11 alimentation alimonia alım durable durabilis dür- (v.) orate oratio orı: (n.)
12 alimony alimonia alım duration durationis dür- (v.) ortho- ortho- örti- (v.)
13 all omn- alqu (n., adj.) duress duriciam dür- (v.) other alter ötürü (adj.)
14 alms eleemosyne almak ea aqua- aq- (v.) otter lutra ätär
15 amorous Amor amran- eat edi ye- (v.) owl ulula aba(qulaq)
16 anger angustus özak (adj.) elbow ulna  el pan patina ban
17 anguish angustus özak (adj.) eligible eligibilis elïg- (v., n.) papa papa baba/babai
18 apian apis arï elite eligere elga- (v.) peace pax barısh
19 apt aptus apt elk alces elik period periodus ö:d
20 arch arcus arca elm ulmus ilm pot potus patır
21 ardent ardere arzu (n.) en- in- en- (v., prepos.) prior prior ür
22 argue argutare arqu- (v.) endure durare endür- (v.) purge purgare pür- (v.)
23 arrogant arrogantia orı: enge angustus özak (adj.) purl burra bu:r- (v.)
24 Arthur Arturius artur- (v.) engine ingenium ïjïn- purse bursa bursaŋ
25 asp aspidem äväs enigma aenigma tanığma push pulsare puš- (v.)
26 ass asinus eš(äk) equal aequalis egil (adj.) pyre pyra bur- (v.)
27 assign assignare asïɣ ether aether äsir quality qualitas qïlïɣ
28 astute astutus asurtɣuq (adj.) Europe Europa ev + opa quantity quantitas köni
29 at ad at- (v.) evacuate evacuare evük- (v.) quarrel querella qaršï
30 Augean Augeas aqür evict evictio evük- (v.) quilt culcita kübil
31 augur augur ay- (v.) evoke evocare evük- (v.) quit quietus ket-
32 aurora aurora yar- Eve ava eve ration rata ruzi- (v.)
33 axle axis i:k ewe ovis, ava eve regal regalis arïɣ (adj.) 
34 barge barga bart (adv.) exhaust exhaurire qoxša- (v.) sack saccus sak
35 bark barca barq eye oculus ög- (v.) sagacity sagax sag
36 barley far arpa, urba face facies yü:z sage sage sag
37 be fui buol- (v.) faith fides vara salary salarium salɣa (v.)
38 bear fero ber- (v.) false falsus al- (v.) saliva saliva liš
39 bear ferus böri far per ıra:- sanitary sanitas esan (adj.)
40 belt balteus bel fart bombulum burut- (v.) sanity sanitas san- (v.)
41 board bordus batɣa father atta ata sapient sapere savan (adj.)
42 boil bullire bula- (v.) feel palpare bil- sapphire sapphirus sepahir
43 bore forare bur- (v.) fire pyra bur- satisfy satisfacere satsa- (v.)
44 box buxis boɣ first primus, prae bir satyr satyr satir
45 brother frater birader fissure fissura öz savant sapere savčï (v.)
46 bruise brisare bürt, bert flask flasconem baklaga say (in)seque söy- (v.)
47 brute brutus börü food pabulum apat sconce sconce quč-
48 bulge bulga beleg (n.) foot pes but seat sedess čıj- (v.)
49 bull bovis buqa frog varde baga secede secedere ses- (v.)
50 burl burra burnï gene genus gen- (v.) second sequi eki
51 bursary bursar bursaŋ genu genu yinčür- (v.) sector sector chektür
52 cabbage caput qabaq George Georgius urï seize sacire sız- (v.)
53 callous callus qal (adj., v.) get prehendere qay- (v.) sepia sepia sepi- (v.)
54 calm cauma kam- (v.) glue gluten yelïm sever separare sevrä- (v.)
55 calumny calvi čulvu glut gluttire oglït- (v.) sew su(ere) suk- (v.)
56 can canna kanata gluten gluten yelïm shade   čadïn
57 candle candela kandil goat caper käči sharp scalpellum šarp (adj.)
58 cap cappa kap gore cruore göres- (v.) sin sons cin (jin)
59 capture captura hapset grind frendere qïr- sip sipho, suppa syp (v.)
60 capuche kapuce kapšon  guest hostis göster sit sedere čıj- (v.)
61 car carrus köl- (v.) hash ascia ash so suad aša (adv.)
62 care cura qorq heap chupa kip sock soccus sok- (v.)
63 carpus carpus qarï heart cor chäre son sunus song
64 case capsa kečä herb herba arpa: stick instigare tik- (v.)
65 case casus qaza hernia hernia urra suave suavis šuvlaŋ
66 castigate castigare kast- (v.) hey eho ay (interj.) suck sugere saɣ- (v.)
67 castle castrum kishlak hilarious hilaris güleryüz (adj.) sure securus sürek (adj.)
68 casualty casus közün- hoopoe upupa üpüp susurrate surdus šar šar (v., n., adj.)
69 cat catta četük I ego ič (es) suture sutura sač
70 category categoria qatïɣ (adj.) ideal idea edil (adj.) swear verus, fides  vara- (n.)
71 cause causa köze:- idyl idyllium edil (adj.) tab tabula tap- (v.)
72 cavalry caballus keväl ignite ign(ire) yaq- (v.) tablet tabula tü:b
73 cavern cavus kovı: inch uncia ınča take tolle tut- (v., n.)
74 cavity cavus kovı: itinerate iter ïd- (v.) tally talliare tili- (v., n.)
75 cemetery coemeterium semäklä- (v.) jar jaru- jar- (v.) tariff tarifa tarïɣ
76 chagrin gravus qadɣur jelly gelu yelïm taste taxare tat- (v.)
77 chalk calx chol joke iocus elük tavern taberna tavar
78 chastise castigare kast- (v.) journey diurnalis jorï (v.) tend tueri taya
79 cheap caupo čıp (adj.) judge iudex ayg- (v.) terrain terra ter- (v.)
80 check scaccarium chek juice ius testament testis tutsuğ
81 cherub cherub čebär Kent Cantia keŋit- (v.) testicles testis tasaq
82 chill gelidus čil key clavis kirit theriacum theriacum tiryak
83 chip cippus čïp kin gnasci kin/kun/kün throne thronus tören
84 chisel caesus čiz- (v.) kitchen coquina qatna- till tegula til- (v.)
85 circle circulus sürkülä (v.) language lingua luɣat toilet tela tölet
86 coagulate coagule qoyul- (v.) leak libarå liš tire (n.) ? teyir
87 cob caballus kev- might magus bögü: toll tolonium tol
88 coffin cophinus kovı: mallet malleus maltu tomb tumba tumlu
89 cold gelu xaltarä mama mater mamü too etiam de (adv.)
90 collect colligere kölar (v.) mammall mamma meme tor torus tärä
91 colon colica kolon marasmus marasmus maraz torah Torah tör
92 color color kula master magister bögü: touch toccare toqï (v.)
93 colossal colossus qolusuz me me min (pron.) tower turre türma
94 com- com-, cum kon- (n., adj.) mental mentalis meŋtä (adj.) tremble tremere četre (v.)
95 con concipere qun- (v.) menu mand(ucare) meŋ tuber tubus tü:b
96 con- co-, con- kon- (n., adj.) message missus muštu tumulus tumulus tumlu
97 confer conferre ber- (v.) mickle magnus  mig (n., adj., adv.) turn tornare tön (v.)
98 cook coquere kok- (v.) milk målñà meme udder uber ud
99 copious copia köp (adj.) mind mens ming ululate ululatus ulï- (v.)
100 cork quercus kairy mint moneta manat unite unus una- (v.)
101 cost cost kı:z mock modus -mak us nos ös (pronoun)
102 count computare köni model modus -mak use usus, uti, oeti tusu (v., n.)
103 courage cor kür (adj.) moisture mucidus mayi usher ostiarius üšer (v.)
104 cove cavus kovı: monastery monasterium manastar vacate vacare evük- (v.)
105 cowl cuculla kalpak money moneta manat vacuum vacuum evük- (v.)
106 coy quietus köy- (v.) mother mater mamü valerian valerianus pultäran
107 crime crimen krmšuhn (v.) mount mons mün- (v.) voe vah, vàå uvy (interj.)
108 crow corvus karga mountain mons mün- (v.) vouch vocitare buč- (v.)
109 crust crusta kairy mouse mus muš voucher vocitare vučuŋ
110 cry quiritare qïqïr- (v.) munch manducare meŋ wake vegere vak
111 cull colligere čul- (v.) muscle musculus muš watch vigil aɣtur- (v.)
112 cup ciphus kap       worse vah, vàå uvy (interj.)
              yet sed yet- (v.)
              young juvenis yangi:

The quite substantial presence of Türkic–Latin–English lexical correspondences mocks the standing thesis that the Centum language group is devoid of the Türkic presence. The Centum group includes Italic, Germanic, Celtic, and Hellenic languages, and besides English and other Germanic languages, numerous cognates of the Türkic stems belonging to the Italic, Celtic, and Hellenic languages. Originally, the thesis may have reflected the true extent of the knowledge of its authors, but the continuous recitation of the old thesis must be an embarrassing display of orthodox beliefs. The conundrum is not amendable: albeit in the past two centuries numerous modern languages underwent ethnic cleansing, it is impossible to mechanically cleanse the ancient Greek or Latin, their Turkisms have been sown wide and deep, and grew deep roots, and they provide indelible evidence.

Sanskrit and Türkic Languages

The IE linguistics holds Sanskrit as an IE language; the traditional IE theory claims its age at about 1700 - 1500 BC; one of the pillar tenets at the foundation of the IE theory was, and that is still holding, the great antiquity of Sanskrit. Sanskrit was and continues to be held as closest to the PIE language. The opposition within and without IE linguistics disagrees with every tenet. Long lists of non-IE Vedic and Sanskrit words were compiled, similar to the Germanic and Russian non-IE listings, believed to be of non-Aryan origin. Were identified extensive lexical, phonological, structural, and morphological correlations between the old and modern Indo-Aryan languages and the non-IE languages of India, mainly Dravidian and Munda. Indian scholars challenge the Eurocentric classification of the Sanskrit as an IE language, asserting that it is a product of socio-politico-cultural circumstances. The observed lexical and structural similarities are hardly traceable or attributable to borrowing, convergence, etc. Some opposition supports a concept of “South Asia linguistic area” covering various Indian languages and language families.

Historical background attests to more than 2 millennia of the Türkic presence in the South-Central Asia, with a long list of ethnicities and polities, starting with Indo-Scythians and ending with the Moghuls, from whom Britain wrestled control of India. The linguistic traces of the Türkic presence are indelible in the Indian languages, Cf. the Indian hallmark sari, the Türkic “wrapped”. In that respect, Sanskrit is consistent with the other Indian languages.

In the linguistic aspect, historical background should also question the linguistic cohesiveness of the Indo-Aryan farmers. No mass migrations can happen in one humongous swoop. Any migratory process has a wave character, with reciprocal movements that involve gradually increasing flows from gradually increasing source territories. Accepting that the source population even within an ethnically single phylum was a continuous chain of mutually incomprehensible vernaculars every 200 km, it is reasonable to propose linguistically motley sources of the migrants, who in new location for a millennium were forming a new lingua franca that finally reached us as a Classical Sanskrit. The surviving IE linguistic elements of the Classical Sanskrit are a sampling of the vernaculars involved in unidirectional migration.

The 800-word assembly of Turkisms and Türkic substrate in English includes frequent instances when the IE etymology cites Sanskrit to establish the IE origin of the English lexis. The body of the references to Sanskrit and Avesta attests that both were familiar with and to a degree amalgamated with the Türkic languages, even down to morphological elements. The status of Sanskrit, a product of the 2nd mill. BC, as some preeminent version of the PIE language, a vision dated to the 18th c., had long lapsed, but still lingers on in many IE etymological discourses.

An initial trial of the English-Sanskrit-Türkic lexical correspondences, using a preliminary listing of the English-Türkic correspondences, found a match for about 10% of the sample. Results of the initial trial indicated that arbitrarily allowing a conservative 10% for the remaining Sanskrit lexis, about 20% of the Sanskrit lexicon is retained or is based on the lexical base that originated with the Türkic linguistic family. Results gave some rough idea on the Sanskrit admixture, they were consistent with the assertion that only a fraction of the presumed European IE vocabulary reached the Indian subcontinent.

A consequent test using the 800-word list, found a match for 100 words, or 12% of the sample (Table 2a). The result of the expanded comparison agrees within reason with the result of the initial trial. The test covers apparent Turkisms in Sanskrit; it is a statistical indicator of the Türkic-Sanskrit lexis vs. the Türkic-English 800-word list. In respect to the Sanskrit lexis, it is a random lexical selection, since the content of Turkisms in English is independent from the content of Turkisms in Sanskrit, or of the Sanskrit loanwords in Türkic: 100 words of a random 800-word list. This number can't be applied to the whole body of the Romance in English, which is expected to be much diluted by various admixtures, and be smaller by an order of magnitude. In the context of the Türkic-Sanskrit-English commonality, the number 12% is significantly exaggerated by dubious Skt. cognates. It can be inferred that without Türkic words absorbed in the last two and a half millenniums, and the dubious cognates, the proportion of the Turkisms of the Classical Sanskrit origin would be reduced by half, to about 5% of a random sample. Allowing another 5-10% to cover the other Sanskrit Turkisms from the remaining portion of a typical 10,000-word Sanskrit vocabulary, a very rough prediction can be made that about 10% of the Classical Sanskrit lexis hails from the Türkic phylum. The numbers in English and Sanskrit are quite compatible, difference is but in the degree of admixture..

However, the number 10% is a very rough approximation, being dependent on the number of factors that influenced etymological sources. First and foremost is the poor appearance of Sanskrit cognates in the IE etymological exercises: the citing is spotty, only to the degree needed to establish connection consistent with the premise of the effort; just that flaw may underrepresent reality by a factor of 2 or 3, potentially raising proportion to 20-30%. Then there is a loose treatment of semantics, when Sanskritisms are cited inappropriately (Cf. janiṣ is cited both for “queen” and “wife”, “milk” is derived from “wipes off”, “bewail” equated with “call”, “Earth” is equated with thira which is an allophone of “terrain” and hails from the Türkic for “pasture”, etc.); that factor is falsely inflating the proportion. The presence of Sanskrit terms of Buddhist lexicon that grew on the Indian soil also inflates the proportion, since although they entered English as Turkisms, etymologically they originated with Sanskrit and were brought to English as the Türkic substrate at about the turn of the eras. These cases are few, but with the small sample of 100 words they need to be considered. Then there is a case of erroneous attribution to Turkisms in the body of the 800-word list; the questionable cases also inflate the proportion by as much as 1/3. Adjusted for these factors, the proportion of Turkisms in Sanskrit may be reasonably assessed as to be in the range of 10 to 15%. The presence of the basic vocabulary that could not have been introduced by the demographicly inferior Indo-Saka, Indo-Scythians, Huns, Kushans, Ephthalites, and later migrants, attests to a time depth of these loanwords ascending to the middle of the 2nd mill. BC, the time of the initial migration of the Indo-Aryan farmers to the South-Central Asia.

Table 2b. Türkic–Sanskrit–English lexical correspondences
  English Sanskrit, Avesta (Av.) Türkic English Sanskrit, Avesta (Av.) Türkic
1 act (v.) ajati “drives”, ajirah “moving, active” aqtar- (v.) ignite (v.) agnih “fire” yaq- (v.)
2 agile aja- “drive” ačïl itinerate (v.) e'ti “(he) goes”, Av. ae'iti ïd- (v.)
3 anger, anguish aihus, aihas, Av. azah- “need” özak (adj.) juice yus- “broth”
4 at (prep.) adhi “near” at- (v.) kin janati “begets, bears”, janah “race”, jatah “born” kin/kun/kün
5 aurora usah “dawn” yaruk lull (v.) lolati ulï- (v.)
6 axle aksah i:k mama matar- mamü
7 bake (v.) pakvah “cooked” bukač mantra mantra-s “sacred message or text, charm, spell, counsel” maŋra- (v.)
8 band (v., n.) bandhah ba- (v.) me (pron.) Skt., Av. mam min (pron.)
9 be (v.) bhavah “becoming”, bhavati “becomes, happens” buol- (v.) mead madhu “honey, honey drink, wine”, Av. maδu mir
10 bear (carry) bhárāmi ber- (v.) mental (adj.) matih “thought, mind” meŋtä (adj.)
11 bode (v.) bodhi bodi mickle mahat- “great”, mazah- “greatness”, Av. mazant- “great” mig
12 bow bhujati boq- (v.) milk marjati “wipes off” meme
13 bursary buddha sangha, bursaŋ mind matih “thought, mind” ming
14 call garhati “bewail, criticize” qol mist mih, megha “cloud, mist” muz
15 candle cand- “to give light, shine”, candra- “shining, glowing, moon” kandil mouse mus “mouse, rat” muš
16 cap kaput- “head” kap oat avasám “food” ot
17 case (box) karsha kečä ogle (v.) akshi “eye” ög- (v.)
18 case (instance) sad- “a falling” qaza other antarah “other, foreign” ötürü (adj.)
19 cave kupah “hollow, pit, cave” kovı: otter udrah, Av. udra ätär
20 chill (v., n.) hladate “refresh”, (pra)hladas “cooling down, enjoy” čil pot patra “bowl” patır
21 chintz chitra-s “clear, bright” (Hindi chint) čit purge (v.) pavate “purifies, cleanses”, putah “pure” pür- (v.)
22 cook (pa)kvah “cooked” kok- (v.) queen janiṣ “wife, woman”, gna “goddess”, Av. jainish “wife”, gǝna-, ɣǝna, ɣna, ǰaini “woman, wife” yeŋä
23 cow gaus coy regal (adj.) raj- “king, leader” arïɣ (adj.)
24 crust krud- “make hard, thicken” kairy sapphire sanipriya “dark precious stone” sepahir
25 curt (adj.) krdhuh qïrt (adj.) sari sati “garment, petticoat” sarïl (v.)
26 daddy tatah (Hindi dada) dedä sew (v.) sivyati “to sew” sač-
27 day dah “to burn” dün shade chattra “parasol” čadïn
28 dementia matih "thought," munih "sage, seer” dumur shake khaj “agitate, churn, stir” silk- (v.)
29 din dhuni “roar” tîŋ shock (v., n.) khaj “agitate, churn, stir” šok- (v.)
30 ea (OE) ap “water” aq- (v.) sinew snavah, Av. snavar “sinew” siŋir
31 Earth thira Yer sip (v.) sabar- “sap, milk, nectar” syp (v.)
32 eat (v.) atti ye- (v.) sit (v.) sidati “(he) sits” čıj- (v.)
33 enge (adj.) (OE) aihus, aihas, Av. azah- “need” özak (adj.) smile (v., n.) smayatē, smayati, smēras, smitas semeye (v.)
34 ewe avih eve son sunus song
35 eye akshi ög- (v.) stair stighnoti “mounts, rises, steps” šatu
36 far parah “farther, remote, ulterior” ıra:- susurrate (v.) svara- “sound, resound” šar šar (v., n., adj.)
37 fart pard burut- (v.) suture sutram “thread” sač
38 father pitar- ata terrain Skt. thira ter- (v.)
39 fire (v., n.) pu bur- this, that ta- šu (pron.)
40 first Skt. pura “at first, in the past”, Av. paro “in the past” bir tree dru “tree, wood” terek
41 fissure bhinadmi öz turf darbhah “bale of grass” ter- (v.)
42 foot pad-, Av. pad- but udder udhar ud
43 itinerate Skt. e'ti “(he) goes”, Av. ae'iti ïd- ululate (v.) lolati ulï- (v.)
44 gene janati “begets, bears”, janah “race”, janman- “birth, origin”, jatah “born”, janiṣ “wife, woman” ken- us (pronoun) nas, Av. na ös (pronoun)
45 genu janu, Av. znum yinčür- (v.) wake vajah (n.) “vigor”, vajayati (v.) vak
46 go (v.) gjihite “goes away” git was vasati var- (v.)
47 God huta- “invoked” kut wife janiṣ “wife, woman”, gna “goddess” ebi
48 gold hiranyam, Av. zaranya al(tun) wise veda “knowledge” vidya
49 herd sardhah kert wolf vrkah, wrkas, Av. wəhrko börü
50 I (arch. ic) ah(am) ič (es) young yuva yangi:
        yuck ye:k “demon” yek

The detectable presence of the Türkic substrate in Sanskrit points to the demographically mixed origin of the Indo-Aryan farmers, an idea that seems to have never crossed the minds of the IE linguists. At the dawn of the Bronze Age, and enduring into the 21st century, ethnicity and trades were connected very tightly. The leaders and rulers could have been of one ethno-linguistic group, the military base and its economy of another ethno-linguistic group, the Indo-Aryan Levites of a third group, farmers and their economy of a fourth group, and artisans still of another group. We know for whom the Hebrew Levites wrote their version of the history, but we may never know the native vernacular of the Indian Levites. The Sanskritic foundation may be a layered cake, misdated, misinterpreted, taken for a gospel, and exploited by patriots and politicians. The problems of the IE theory are rather systemic, precipitated and protracted by loyalty to the premises grown more on the notions of aged popular beliefs than on empiric experience. Started as a lump of raw dough, it was baked into a good-looking tangible and spongy loaf, tunneled through in all directions by incessant criticisms, and now stands as a fossilized crust, to be re-ground and re-baked with whatever flour it contains.

Germanic and Türkic Languages

There goes around a notion that Türkic-IE connection does not exist, that the IE could and was solely impacted only by the Ugro-Finnic group. In that scheme of geographical ethnography, the highly mobile Türkic people were confined to Altai, and Altai is too far from the European arena to possibly pass any borrowings into the IE languages. As a principle, alternate explanations are not considered, facts are viewed through a lens of preconception. This myth is solidly supported by a thorough disregard of linguistic reality. In contrast with the IE etymologies, most of the Türkic borrowings, or rather sharings, are so transparent, it takes a certified blind to pretend not seeing them with a naked eye. The etymology of the Türkic substrate in English practically does not exist, most of the Türkic words in English are left without any, even most flimsy, explanation. Etymological dictionaries and encyclopedias state with a straight face an “unknown origin”, or at best lead to OGk. or OLat., like if those were there on the first day of creation. Within the IE-centered enterprise, etymologies of “unknown origin” are routinely fossilized, for centuries recited between publications in defiance of scientific curiosity within the IE cocoon.

The reality is much simpler than it is popularly presented, and at the same time much more interesting.

Forrer (1894–1986) advocated that IE was composed of two unrelated languages (Forrer E., 1934, Neue Probleme zum Ursprung der indogermanichen Sprachen. Mannus”, B. 26).
 - Von den Velden, Friedrich, 1912, 1920 stipulated Uralo-Altaic origin of the Germanic substrate. At the time, the Uralo-Altaic was a pre-1950's concept.
 - Also, Feist, Sigmund (1865–1943), 1932, “The Origin of the Germanic Languages and the Europeanization of North Europe”. Language (Linguistic Society of America) 8 (4): pages 245–254. doi:10.2307/408831. http://jstor.org/stable/408831
 - Also, Uhlenbeck C.C., (1866–1951), 1957, suggested that IE was a mix of Ural-Altaic and Caucaso-Semitic type languages, The indogermanic mother language and mother tribes complex // American Anthropologist, Philadelphia, v. 39, no. 3, 385-393.
 - Also, John A. Hawkins (1990), Germanic Languages, in The Major Languages of Western Europe, Bernard Comrie, ed. (Routledge). ISBN 0-415-04738-2
 - Also, Edgar C. Polomé (1990), Types of Linguistic Evidence for Early Contact: Indo-Europeans and Non-Indo-Europeans. In: Markey-Greppin (eds.) When Worlds Collide 267-89.

The list is going on and on.

Notably, in reconstructive phonology, “The development of vowels in German languages shows a feature resembling the reconstructed development in Altaic: the reflexation of vowels (in particular the short ones) in stem-initial syllables strongly depends on the vowels of the following syllable(s)”, which allows reconstruction of Germanic phonology using Altaic parallels (A.V. Dybo, G.S. Starostin, 2008, In Defense of the Comparative Method, or The End of the Vovin Controversy//Aspects of Comparative Linguistics 3, p.139, Moscow, RSUH). One has to climb a Türkic ledge to reconstruct the Pra-Germanic phonology. Is not this an ultimate insult to the IE pedigree and philologists who were solemnly lodged on an island, unaware that they are perched on a fringe of the Eurasian peninsula. Like in the geocentric model, Eurasia rotates around Alps.

The Family Tree model keeps inflicting serious damage. With the Family Tree blinds on, any aberration either creates a mental block that leaves it alone, or engages an overdrive not unlike the universally supported science on the Aristotle's geocentric model with its ever growing entangling of the embedded epicycles before the appearance of the heliocentric model in the course of scientific revolution. The phonetics, for example, instead of surveying a list of possible phonetical kins in search of corroborating evidence, keeps developing epicycles to fit into the traditional creationist model the widely acknowledged phenomena of the Germanic and Türkic phonetical parallels. It was not the door that was tightly closed, it was the model that spurns the open doors. Opening the door would not only advance the English and Germanic linguistics, but would also illuminate the non-IE contributors to the IE line. A good example of the fruitfulness of such studies is the Bulgar lexicon extracted from Hungarian; it meshes up with other Türkic vernaculars, enriching not only Hungarian and Türkic philology, but also going far beyond into the IE and Oriental studies not confined to linguistics alone. Such single-dimensional self-impairment breeds primitive concoctions compatible with miraculous stories that produced simplistic models parodying intricacy and magnificence of the actual evolution.

Aside from the Old French impact, in English is visible a blending of three unrelated languages, and some attentive eye would detect that the historical development of English differs from the bulk of the Germanic languages, pointing to a separate, yet related, source. Sufficient indicators point to languages with their own version of lexicon and morphology. The main gap that separates English from German is filled with French borrowings that introduced Latin-derived lexis. The French borrowings, however, have a superficial nature, they did not impact too much the old Old English daily vocabulary. In contrast, unlike English and other Germanic languages German was not affected by the French borrowings.

The commonalities between English and Germanic languages are enduring, for example the difference between the front rounded morpheme u (muse, ü) and back morpheme u (cook, u) are consistently retained going all the way to the pra-pra-language. Graphically, they are frequently camouflaged by the attempts of various writers to present the front rounded u distinctly from the back u: you, iou, ui, ull vs. u, ou, oo, oue, ul; same with the front i vs back i: e, ea, ee, ei, ie vs. i, y. In the pra-pra-language the difference was semantic and critical, carried on as far as it was needed, but the creolized English mostly lost that semantic function, as well as the ancient suffixes, and the retention of the once critical differences is purely inertial. Still, the abundance of spellings for the phoneme i, for example, is quite telling, at the dawn of Romanization the differences were significant enough to warrant explicit coding e, ea, ee, ei, ie, in addition to the laconic i, with an emphasis on the semantically distinct special quality of the rendered phoneme. Other languages, encountering the same problem with the front i vs back i, similarly came up with local inventions, like the Cyrillic è vs. û or ï. Prima facie distinct, these are paradigmatic solutions to a paradigmatic transfer problem. The imprint of the substrate phonemics still lives in the daughter languages.

An unintended consequence of the search for the origin of the English's substrate is a review of the state of the purported IE etymology for the English Turkisms, where there is one. Except for the candid “of unknown origin” and “no known cognates beyond Germanic (and at times Celtic)”, the rest is marked by systemic dishonesty, artificiality, or open flimsiness. Where the IE etymology of Turkisms appeared to be plausible, a doubt was interpreted in favor of the IE version, and such words were not credited as Turkisms. Some trends, like appealing to Lat. prefixes to chop down an original root, may even be formulated as some IE linguistic law: all original stems starting with e-, for example, can be wiped out with a single device of “assimilated ex-” - “out”, even in cases when a cognate family clearly attest that e- is a part of the root. The appeal to “onomatopoeic” or “echoic” origin is a routine antic, never supported by a survey of the “echoic” forms across linguistic families. Flimsiness or sleight of hand reigns in not citing inconvenient cognates not only from the eastern languages like Kor. or Mong., but even from within the IE languages like the Sl. In some cases, falsification borders on ridiculous, see mare for example. While cognates of mare predominate across Eurasia, the IE etymology boldly declares that there are “no known cognates beyond Germanic and Celtic”. Criticisms and disputes on the IE etymological methodology are as old as the IE etymology itself, and the present review of the substrate layer presents abundant illustrations of its shortcomings.

A majority of the Turkisms belongs exclusively to a class of the Germanic languages, with a sprinkle of random guests into selected neighboring languages. Etymological assessments glide over that exclusivity without specifically noting it and any systemic analysis. Result is a loss of the historical connections, the very objective of the etymological history. With the tool of the faux “reconstructions”, a word is doomed to be sunk into the netherworld of the PIE smoke pool. It is an image of a Tree of Life with apples and watermelons hanging off the same branch. It is an image of a special pleading PIE Babylon Tower and miraculous creation and scattering for a peculiar group of languages. Besides the class of the Germanic languages, there is a distinct class of the Germanic-Slavic strata. While the etymological historicity of the Norman-Anglosaxon blend is perfectly transparent, the Germanic-Slavic blend is blurred in the smoke pool as Germanic-to-Slavic loanwords, albeit a good slice if not a majority of it is rooted in the omitted shared Turkisms. Each side is cooking their own historical broth within proprietary confines of the etheric “proto-language”. Under a guise of loanwords, history is stripped off the historicity, turning its linguistic aspect into a mystic unlit forest.

Scandinavia has a distinct and uncluttered genetic footprint, with salient haplogroups I1, R1b, and R1a. The haplogroups I1 and R1a came from the continent, they probably were already amalgamated and shared a common “Old Europe” language. The haplogroup R1b came as a nomadic wave of Kurgans from the Eastern Europe, they belonged to the Kurgan Pit Grave archeological culture, liable to speak their own distinct Kurgan language. At about 3700 ybp (1700 BC), the newcomer Haplogroup I1 supplanted and joined the genetically related older 8000 ybp-old (6000 BC) Scandinavian haplogroups I2 and I. The haplogroup I1, previously widely scattered across Europe, found a refuge in Scandinavia. The other refuge area was the forested northern area of the Eastern Europe with the Corded Ware Sprachbunds. The Scandinavian blend of I1, I2, I, and R1a makes a good candidate for the Indo-European Scandinavian Sprachbund that later expanded back to the continent. The haplogroup R1b, nowadays spread in the Scandinavia almost equally with the leading haplogroup I1, appeared in the continental Europe 4800-4500 ybp (2800-2500 BC). That makes a cause for the European mix of I1 and R1a, with the newcomer R1b, to come to Scandinavia safe heaven as a compact, in a single movement that could last for centuries, akin to the directional movement of the Bulgar and Slav compact to the Danube Bulgaria. Demographically, the R1b nomadic Kurgans were a minority, the I1 and R1a farmers were a majority. The nomadic Kurgans were the military and ruling force, farmers were the manpower and economic base. These first Kurgans in Scandinavia should not be confused with the later Kurgans around the New Era. 1500 years later, by the Sarmat time, the Scandinavian blend of the I1, R1a, and R1b had probably amalgamated, with only traces of the former statutory stratification.

By the turn of the eras, they formed a distinct ethnicity named Germanic, we have a good idea of the early Germanic languages, and of the dialects within individual languages. At the base of the Germanic languages lay the surviving languages of the “Old Europe”, enriched by a heavy admixture of traceable Turkisms. From the Scandinavia of the 2600-2300 ybp (600-300 BC), Germanic people and languages rolled down to the Alps and Black Sea, to the Gaul in the west, and Vistula in the east, reconstituting the “Old Europe” languages in their newer IE hypostasis, and leaving Pomerania with its predominately R1a population an isolated island. The languages of the Scandinavian migrants to the continent constituted a single Germanic phylum, mutually incomprehensible neither with the continental “Old Europe” languages, nor with the European Türkic vernaculars.

With the arrival of the haplogroup R1b, connected with the Celtic Beaker culture spreading from the west, and the Pit Grave culture spreading from the east, nearly entire male farming population in the continental Europe marked by the haplogroups I1, I2, E1b, and R1a had waned, leaving there only the haplogroup R1b for an auspicious period of 4500-3500 ybp (2500-1500 BC). The old female European population endured and even increased. If languages are anchored by women, after 3500 ybp (1500 BC) the “Old European” fraction had to increase, at the expense of the prior Türkic fraction. Return of the Scandinavian-Germanic and Corded Ware refugees repopulating Europe after 3500 ybp (1500 BC) reseeded Europe with the male haplogroups I1, I2, E1b, and R1a, and with the relict reflexes of the “Old Europe” languages.

The Scandinavian languages and haplogroups provide a lab-quality experimental data on the correlation of binary genetic/linguistic components prior to the Scandinavian-Germanic expansion to the continent. Presently, Sweden has R1b 22%, with I1 37% and R1a 16%; Norway - R1b 32%, with I1 32% and R1a 26%; Denmark - R1b 33%, with I1 34% and R1a 15%. The kindreded haplogroup I2, fairly well correlated with the IE Slavic populations and ubiquitous in the Old Europe, is currently present only in trace numbers, Sweden 1.5%, Norway 0%, and Denmark 2%. The R1b/(I1+R1a) numbers serve as a ballpark predictors of the Türkic/IE linguistic admixture, about 20-30% of the Scandinavian lexis. The proportion of Turkisms, in agreement with the clinal distribution of R1b as a marker of originally Türkic phylum, has to reduce from the European North to the Mediterranean South, while allowing local spurts to extend at any distance in any direction. The older R1a, sedentary in Europe and nomadic in Asia, the “Old Europe” in the European west and Türkic in the Eurasian east, would play a secondary swing role, depending on the time and source of its presence.

There is no sane explanation on the origin of the IE component in the Scandinavian languages other than the people marked by the haplogroup subclades I1 and R1a, and in the Slavic languages other than the people marked by the haplogroup subclade I2. That leads to the Old Europe and its languages, a de-facto IE lexical majority in the Scandinavian languages, and de-facto property of the IE Slavic languages. The opposite assertion, advanced by the investigators of the Pit Grave kurgans' genetics, on the IE R1b and the Türkic I1 and R1a, leads to a nonsense, to the opposite, a Türkic component majority and IE component minority in the Scandinavian languages, and a pile of other corollary absurdity. The Scandinavian linguistic lab provides incontestable evidence in favor of the IE – I1 and R1a correlation.

This non-thought experiment in the Scandinavian lab identifies the Old Europe of the 8000-4500 ybp (6000-1500 BC) as a transitory station on the IE path, a time of creeping agriculture and a demise of the hunter-gatherer economies. The period of 4500-3500 ybp (2500-1500 BC) is a transformative period, connected with massive relocations, realignments, and amalgamations. For a thousand years, or for 40 generations, the fractioned “Old Europe” male fugitives procreated with the local females at dispersed refuges, grew up with their languages, and were forming amalgamated IE Sprachbunds. Their return and dispersion 40 generations later was no less eventful. The relatively late in the reverse flow migrations, the first historical records from the first literate sources relay the stories of the Dorian invasion, Achaean invasion, Hittite invasion, Aryan invasion, wars of conquest, etc., emanating from the N. Pontic area, with no less destructive consequences than the Kurgan invasion of Europe in the 4th-3rd mill. BC. In the 1st mill. BC episodes of relocations, realignments, and amalgamations lay the roots of the Anglo-Saxon origins, the origins of numerous Germanic tribes connected with the history of England and English language, and with the history of the IE layer in their languages.

It is highly unlikely that a scientific luck will ever encounter a situation duplicate of the massive Scandinavian case, where a historical prism would separate long past events into clearly readable distinct beams of reproducible genetic and linguistic pattern. It is especially fortunate that the Scandinavian lab is a test bed for the very IE family, the scholarship, historiography, enigma, and speculations of which easily exceed those of all other linguistic families combined. At the same time, the improved methods of genetic analysis, improvements in the accuracy of the genetic dating, and philological advances may allow analyses of much less clear-cut situations.

Figure 4. Neighbor-joining tree of European, Turkic central Asian and Turkish (Anatolian) populations constructed from HVS I sequences
(Hatice Mergen et al., Mitochondrial DNA sequence variation in the Anatolian Peninsula (Turkey)//Journal of Genetics, Vol. 83, No. 1, April 2004)

In 2004 came out the Hatice Mergen et al., Mitochondrial DNA sequence variation in the Anatolian Peninsula (Turkey) that surprised the world with the results of genetic connection between the British and Turkish mamas. The linguistics, which ordinarily is at a loss with the genetics, unwittingly got a shot in the arm: the genetically closest kins of the Turkish mamas turned out to be the gelding-riding nomadic Central Asian Türkic mamas (no surprise there) and the British mamas (what a surprise!). And where the mama goes, there goes the language [Hatice Mergen, Department of Molecular Biology, Science Faculty, Hacettepe University, 06532 Beytepe, Ankara, Turkey. E-mail: mergen@hacettepe.edu.tr].

The male Y-DNA R1a and R1b, ubiquitous among the Türkic papas, do not hurt either, they mark genetic protraction between the Türkic peoples and the populations of the European countries.

Latvian is held to be the most archaic language in the N. Europe, and accordingly it is cited as closest to the pre-IE languages of the N. Europe. It also happens to be the only N. European language that was examined for substrate languages, and viola, one of them turned out to be Türkic. The substrate lexicon, morphology, syntax, phonetics of vowels and consonants, even the agglutinative suffixes, all find their roots in Türkic, and they are yet quite compatible with the modern Turkish, although the Turkish belongs to the Oguz branch, and the Latvian demonstrates features and has historical links pointing to the Ogur branch. The difference must be on the range of Hittite vs. modern Italian, and still the volume of evidence is more than overwhelming to demonstrate intimate genetic connection. A close look at Latvian also allowed discerning that its substrate Türkic lexis substantially differs from the Türkic substrate lexis of the Slavic languages. Latvian and Slavic use numerous different synonyms in their languages, evidently coming from different Türkic phyla. And if the Latvian is archaic, what about its substrate?

Close to the 3rd millennium BC, at about 3500 BC, Europe was invaded by the Kurgan wave 2, and soon after that, at about 3000 BC, came the Kurgan wave 3. These Neolithic waves, archeologically associated with the cultures dubbed Battle Ax and Corded Ware, first migrated to the right banks of the river Dnieper, and then farther on to the Central Europe. How numerous was the part of the ancient Türkic tribes and which Türkic tribes passed through the Dnieper area is a moot matter. Most of them amalgamated with the local aborigines, that is amply attested. A part of them moved to the Central Europe, roughly coinciding in time with the Celtic expansion to the Central Europe, wrecking there a havoc known as the Central European “killing fields”. Both invader flows were predominantly marked by R1b haplogroup, the survivals marked by an alphabetical soup of male haplogroups fled to the E. Europe. Of the invaders, only two Türkic tribes, that of the ancient Bulgars and Akathirs, retained their ethnic identity; the Celts were too far remote from their roots, they received a non-ethnic moniker Kelty (y as i in sit), meaning plural “newcomers”. The female-perpetuated creole languages, augmented by diverse admixtures, became the Celtic languages that reached the literate times. The Bulgar-descendent tribes of the Tatars, Chuvashes, and Balkars in their languages have preserved some idiosyncrasies that better resonate with the European Latin, Germanic, and English languages; of all the Türkic languages, some cognate lexemes are attested exclusively in the Chuvash. Many Latin Turkisms can be alternatively attributed to the Celtic or Türkic influence. The ancient Greek Turkisms differ from those of the Italics, they did not have the Celtic influence. The Greek and Latin Turkisms, at times also attested in India, are now classed as innate PIE vocabulary.

The analytical work performed in 2015 determined genetic composition of two archeological cultures, Kurgan Pit Grave and Corded Ware, the first called in Russian Yamnaya, from yama “pit”. The Kurgan folks were predominantly marked by Y-DNA haplogroup subclade R1b-M269 on the male side and mtDNA haplogroup subclades western Eurasian U2e, U5a, T and eastern Eurasian C and A10 on the female side; migrants moved by whole families. The Corded Ware folks were marked by Y-DNA R1a, E-V13, G2a, I2a, I2b, I1, N1c1 on the male side and mtDNA H, K, U5 on the female side. According to M.E. Allentoft et al., 2015, Pit Grave Kurganians were nearly exclusively R1b, Corded Ware were a mix of 3/4 Kurganians and 1/4 locals of the above Y-DNA haplogroups. The steppe migrants replaced ~3/4 of the central Europeans' ancestry. Before 3000 BC the native genomes resembled those of early farmers from the Middle East (G2a) and even earlier European hunter-gatherers (I2b, I1). By 2000 BC the native genomes were more like those of the Pit Grave people. The steppe ancestry persisted in all sampled central Europeans from about 4,500 to about 3,000 ybp. It can be safely submitted that on the average, the local Sprachbunds amalgamated in the same proportion, the Türkic phylum predominated. Migrations were followed by resurgences of the previous inhabitants between the Late Neolithic and present, the farmer and hunter-gatherer lineages rose again when the Bell Beaker and Unetice groups reduced the Pit Grave lineage to the comparable levels of the present-day Europeans. That had to raise accordingly the proportion of the non-Türkic linguistic component. These results elucidate the spread of kurgan burials in the central and western Europe, and provide support for the concept of the steppe origin of the Türkic substrate of the European Indo-European languages (M.E. Allentoft et al., 2015, W.Haak et al., 2015).

Perusal the body of the words of the “possibly pre-IE Mediterranean language”, of the “uncertain origin” words, and a mass of dubious IE and Germanic *asterisked conjectures, gives an impression that most of the “uncertain origin” IE *reconstructions are simply fancifully (or unskillfully, or primitively) slightly distorted Türkic stems and derivatives. On the other hand, excepting conjectures that are too imaginative or semantically too far afield, the IE conjectures do end up resembling their Türkic siblings: a round peg forced through a square hole comes out squarish. A statistical value of the *reconstructions is nil: they are not independently calibrated, are not corroborated with later discoveries, and their only utility is to remold the past. In most cases, a basic understanding of the morphology of the Türkic languages would greatly alleviate the scholarly puzzlement, too often linguists confuse suffixes with parts of the stem, and stem parts with prefixes. Filtering English vocabulary on the “of unknown origin” and the like readily supplies a listing of substrate candidates, and the base words of 2000 years-old life make it easy to locate the Türkic substrate originals with close phonetics and exact or nearly exact semantics.

The concept of Nostratic was formulated in the early 20th c. as an Ursprache pra-language Family Tree model that originated most of the Eurasian languages, from IE to Sino-Caucasian. In its most radical interpretation, the Nostratic concept is reduced to a mega-IE axis. The concept was formulated exclusively on studying the linguistic cognates found in very diverse languages, on the continental geographical scale, and with the implied Family Tree model. It was formulated before the development of the linguistic Wave model, knowledge of the Kurgan waves, understanding of the continental-scale migrations, and methods of genetic tracing and dating. In conceptual dating, and under the Family Tree model, the Nostratic concept envisioned for the Ursprache the times as remote as 30-20,000 ybp. Correcting for the later developments, the Nostratic idea contracts to the linguistic layer disseminated across Eurasia by the Kurgan waves during pre-Kurgan (R1a and R1b haplogroups) and Kurgan periods (R1b haplogroup) that started in the Neolithic and continued till and including the Middle Ages. Some pre-Kurgan Nostratic spread (R1a and R1b haplogroups) started around 10,000 BC, it intensified around 6,000 BC, greatly intensified around 4,000 BC, and reached India and China during 3,000-2,000 BC. The effect of refugees from the Kurgan waves constitutes an innate component of the Kurgan waves' phenomena, with its own linguistic and cultural propagation.


Typologically, Türkic languages are SOV-type agglutinative, genderless, they feature sound harmony, exclusively suffixing word formation, dependents precede their head, and they form numerous nonfinite verb constructions. Nominal stems take on nominal suffixes, and verbal stems take on verbal suffixes. Most stems are polysemantic, with a trail of largely overlapping meanings related to the main notion, with figurative extensions tenuously connected to the main notion, like “break” with figurative “weak” and “melt”. A common development is the use of a single root to create contrasting (e.g. guest vs. host), complimentary (e.g. tick, tack, tuck), and metaphorical (e.g. give, open) notions, it allows great expansion with economical means. Languages form semantic expressions by adding suffixes to the root stem. Like in English, the adjectives are morphologically not clearly distinguished from nouns. The expressive verbal morphology is rich with markers of actionality, possibility, negation, voice, aspect, mood, tense, person, interrogation, attitude and more. Just the Turkish alone, accounting for the vowel harmony alternations, theoretically has over a thousand of grammatical morphemes. Unlike English, which has a canonized list of all English words with a defined total count, the Türkic words can't be counted because new forms are continuously created using a conventional set of suffixes and a conventional practice of their usage, learned with the language and intuitively applied. Theoretically, the Türkic lexis is finite, but nobody knows what is that theoretical number.

On the other hand, the quantity of the Türkic stems is not large, they comfortably fit into single volume linguistic dictionaries. Prefixes are structurally counter indicated, they impact the root and baffle the crisp mechanism of semantic content. Instead of prefixes are used prepositions, audibly identical to prefixes. The accent on the last syllable leaves the root portion undisturbed in any environment, while internalization in English moves the accent to the first syllable. These simple structural conventions secure the distinct endurance of the root system. Words are formed starting with a single morpheme (of which Eng. preserved just one, ö, the word awe), first by agglutinating distinguishing functional morphemes one at a time, then by agglutinating semantical morpheme units (like -lig- “like”), ending up with verbal conjugational morpheme units. Generally, the logical structure of the language allows to trace a word through a linked chain of derivatives to its source, that is a monumental difference between the Türkic and the IE languages with their haphazard mix of agglutinative and random (borrowings and amalgamation) word-forming practice that can't be called “procedure” with the absence of such. Unlike the IE word-forming practice, the Türkic practice is usually predictable in both directions, with the loanwords standing out; that makes the science of Turkologists incompatibly easier than the tortured science of the Indoeuropeistics.

A common feature of the base roots is their two-dimensional semantics, one immediate, and one implied; both sememes may develop derivatives, with the immediate derivative carrying over the implied meaning (like “do something” vs. “do something together”), and the implied derivatives directly continuing implicated sememe but indirectly implying the base meaning (like “together (do something)”). This bifurcated semantics may puzzle a student who sees only a direct meaning, and erroneously concludes that there is no connection between a base root and its derivative; or the connection may appear to be too far-fetched to be credible; while an understanding of the implied notion puts connection in proper perspective (like “stopover” also implies “bunching together”, and thus “tribe”, “crowd”, “companions”, and the like). Frequently, the subject of the implied semantics is not included in a phrase, and a translator of the phrase puts the crucial implied subject in parentheses for the phrase to make sense. The two-dimensional semantics first appears in the linguistic relicts, the single-syllable stems, and continues undiminished well into the era of technological revolutions marked by lexical expansion into construction, metallurgy, and producing pastoralism, when grammatical developments supplanted notional expressions with morphological means to form precise semantic variety by an assortment of grammatical modifications. The use of agglutinative method (Cf. -iš as cooperative mood marker) made unnecessary the use of implied semantics (Cf. “together” as a derivative of “bunch together”, an implied notion of the word “stopover”). At times, from the distance of ages, the base notion may retreat to a background blanketed by its derivatives; that may result in misguided etymology (Cf. šarp, Eng. “sharp”, with a base notion of “steep incline” essential in the mountainous pasturing, it can be used and interpreted as “rough”, “inaccessible”, “difficult”, “tapered”, even “acerbic (taste)”, etc., with the physical base notion on the plains' pasturing atrophied). In a single-dimension linguistic world, the two-dimensionality of the often metaphorical notional semantics may be an impediment. A tree appears as a bush, with stalk obscured by the crown.

A common phonetic feature of the Türkic languages is a somewhat wild, somewhat organized alternation mirrored, for example, by the strongly pronounced Germanic d/z alternation shared with the Türkic family. The most prominent of them are the m/b and z/d/y alternations. Phonetic alternations from the beginning were probably a consequence of amalgamations, and of consecutive amalgamations thereafter. The m/b bifurcation is most telling, the peculiar coexistence of the m/b alternation within the same communities endured for 7 millennia and has survived into the present times, occasionally within a single family the alternation is not registered by the speakers or listeners themselves. Traces of the m/b divide extend far beyond the Türkic milieu, extending across the eastern and western parts of the Eurasia and crossing barriers between linguistic families, with a heavy imprint within the entire IE family. The oldest traces of the m/b alternations are visible in Sumerian, they are also a trait in the Finno-Ugric family including Hungarian. Sumers are believed to have migrated to Mesopotamia between ca 5500 and 4000 BC from the area between Caspian Sea and Hindu Kush and Kopet Dag mountains, about the area of the modern Turkmenistan. They carried an R1b-M269 subclade haplogroup marker (Klyosov A., 2012, p. 87on). The genetical and archeological datings are largely consistent. Suggestions on a temporal priority of one phoneme over the other are unsupportable. The Sumer migrants were but an offshoot of the R1b migration flow, a sidestream of the much greater demographical situation marked by numerous other R1b subclades.

The English, via Anglo-Saxon, lost most of the Türkic suffixes, and added some to its morphology. The modern Turkish retained most of the suffixes, reactivated some that were out of use, and simplified orthography, collecting some phonetically close phonemes under a common symbol. The following listing of the Turkish suffixes allows to highlight modern English suffixes genetically connected with the modern Turkish suffixes; as a body, they constitute a stand-alone paradigm that inescapably attest to the common genesis, a paradigmatic transfer evidence in concert with all other systemic cases of paradigmatic transfer. One of the most significant Türkic suffixes is –t that forms deverbal nouns. The final –t in Anglo-Saxon and English nouns is a good pointer to locate the underlying verbal stems for the nominal derivatives, Cf. Türkic bert “tax, return” (ber- “to give”), ölüt “'killing” (ölür- “kill”), bütüt “completion” (bütür- “finish, complete”); English port “harbor” (bar- “depart”), unit “part of” (una- “agree”), gift “present” (kiv- “give”). Since the suffix –t in English lost its original function, it became an indelible part of the root: port, unit, gift.

The modern Turkish language is a product of amalgamation of numerous Türkic languages, which in varying degree contributed to the modern Turkish language. The Turkey's ethnic map discriminates 25 constituent ethnicities on top of the titular Oguz component, a number of those ethnicities, including within the Oguz fraction, before the amalgamation had distinct languages up to mutually incomprehensible. As a result, Turkish is but one of the Türkic languages with its own peculiar lexis, removed fairly far from many other sibling Türkic languages and likewise from the OT variegated collection, and it can't be used as a weighted instance of the Türkic phylum. According to some estimates, Turkish is a blend of about 86% Türkic lexis and 14% of admixture lexis; it is fairly far removed from the formalized OT sampling, and its daily lexis is variegated much above the 14%. A careful examination of the lexicons of 42+ Türkic languages (the vague number reflects the current state of affairs, not the actual number) would likely produce forms that are genetically nearer to the historical and current English forms. Comparison of the English and Türkic lexicons is largely limited to the root forms, with not infrequent cases of the Türkic suffixes embedded into the English word stem. English was already heavily creolized when the Anglo-Saxons brought it over to the Albion, but the traces of its substrate are still fossilized in the lexis and morphology, and the agglutinated suffixation is an ingrained and burgeoning part of its morphology.

In addition to exclusively suffixing word formation, Türkic languages widely use prepositions that are indistinguishable from prefixes neither phonetically nor functionally. Türkic languages also widely use postpositions that modify conjugated or declined forms. Written separately, phonetically they are indistinguishable from the prefixes and suffixes. The conditional status of the prepositions and postpositions adds caveats to the doctrinal assertion of the absolute absence of prefixes in the Türkic languages. Some of them made their way to English, retaining their semantic function in a new syntactic environment. These cases belong to the body of paradigmatic transfer evidence.

Grammatically genderless for inanimate objects, or where gender is irrelevant, or where gender is clear from the context, Türkic languages may use suffixes -čïn, -kčïn and -lai to mark female gender; otherwise gender, like the age and other traits, is indicated by determinants. In that, Türkic and English are identical, the other Germanic languages are not too far off with their haphazard treatment of gender. In all cases, gender markers are borrowings from neighboring languages; the Türkic gender markers are borrowings from Mongolian and used by the neighboring eastern Türkic languages.

The IE etymology tends to follow a path of deriving verbs from nouns. The Türkic linguistic tendency is the opposite, the prime stem is mostly verbal, the noun semantic is a derivative. The difference is of a cardinal nature for understanding the IE lexicon of the Türkic origin. Among the Indo-European languages the Türkic verbal-nominal homonymy, the system of roots in Türkic languages where the verbal root also serves as a noun, in a most developed form is represented in English, but albeit in an incomplete form is also known in some other European languages (Sevortyan, 1974, 39). The English and Türkic innately share this morphological feature. English has stand (v.) and stand (n., adj.), sit (v.) and seat (n.), sleep (v.) and sleep (n.), with uncounted others, and continues to productively use this inherited Türkic word-forming vehicle. In the Türkic languages, the productivity of the verbal-nominal homonymy faded by the Middle Ages, but in English this ancient Türkic linguistic backbone experienced its Renaissance before the 13th c., and still keeps developing, illustrating a spiral conversion trend in the linguistic processes.

The IE linguistic paradigm is built on attested lexicon, non-attested lexicon, and phonology to explain the linguistic past seen from the perch of the viewer, with little or none attention paid to morphology or syntax. The PIE dictionaries are widely available, they inventory the attested lexicons and suggest non-attested *reconstructed forms, ostensibly in their stripped state, without morphological modifiers, but in fact frequently confusing unperceived suffixes with parts of the stems. The modifiers are structured individually by IE linguistic branch, i.e. Proto-Germanic, Proto-Slavic, etc., but they did not come from the same tree, and even conceptually are not compatible across the IE family. Accordingly, there are no PIE morphological dictionaries, nor a PIE morphological inventory. Such a pathetic state, after generations of linguists spent gargantuan efforts on PIE, has a good reason, reflected in the competing models of the linguistic development: the absence of any trace of commonality.

In the usage frequency statistics, morphological elements are buried with the words, making it impossible to assess the extent and the role morphological elements of different provenance play in forming a language. A cursory qualitative assessment readily shows importance of some morphemes: the listing of words beginning with negation element un-, an allophone of the Türkic negation element aŋ/an, far exceeds the number of words with stems starting with un- (undue “not due” vs. unary “single”): of the approximately 320 words beginning with una-, only 5 stems start so; in this case the amplification factor of the single morphological element un- thus exceeds a factor of 60, it enormously enriches English vocabulary, empowers its expressiveness, and enables stinting communication, all without a hint of a due statistical credit.

The overwhelming majority of the Türkic vocabulary is produced internally, from the basic, mostly verbal, stems of the language. Signally, some Old Türkic words bear unmistaken indicators of being borrowings: they are stand-alone entries without extensive nest of derivatives and without a transparent base stem. This class has few words smacking of the European extraction; the path from the Old Europe to the Old Türkic, which is a collection of exclusively eastern lexicons with a shade of the western languages, is unclear. These borrowings are a class of the words separate from the class of the religious borrowings from the Buddhist (Skt., Prakit, Hindi) and Islamic (Arabic, Persian) lingo.

In Türkic linguistic family numbering 42+ languages, stem defines notion, and grammatical constructs are formed by agglutinating suffixes specific for parts of speech. With some exceptions, English lost many suffixes at some pra-English stage; thus the Türkic generic stems became English verbs and nouns without functional markers; in English, they are used at random as verbs, nouns, and verbal or noun adjectives, and then they develop various verbal and noun derivatives. Both in English and Türkic, the suffixation method is extremely productive, it allows to create the same notions with different nuances from a variety of unrelated stems, like cloudy, dimly, murky, shady in English and 20+ forms for greedy in Türkic, greatly enriching the language. In Türkic, the verbal form of the stem is predominant in forming nouns and other derivatives, which in turn may develop, in a spiral fashion, verbs from the derivative nouns and adjectives. With time, some Türkic suffixes waned from active use, remaining active relicts, while the replacement suffixes rose; parallel usage of relict and active suffixes is normal, with different forms at times acquiring differing semantics. In some cases, the English substrate and loanwords stick out because they carry imbedded vestiges of the long-gone Türkic suffixes.

In any language, lexis alone does not make a language. A collection of structural grouping of sound bits aligned into into morphemes and words that constitutes lexis is no different from a pile of components in a scrap pit, be it a ruined cathedral or electronic refuse stripped of precious metals, that is an unidimensional pile. A second dimension is a structure, it made the cathedral before its ruin and a computer before salvaging, it makes a language. The structure of a language makes it a systematic means of communication to convey meaning. At the sound level, closely related dialects may differ wildly one from each other while at the structure level the same wildly differing dialects would be nearly identical. In the accepted classification typology with linear order of precedence, the Türkic and Germanic languages are classed as SOV-type, while English lost its initial SOV structure, and replaced it with a rigid SVO-type, obliterating a significant former structural commonality between English and the domain of the Germanic group. Of the complexity of the Germanic group SOV languages, represented by Dutch, German, and Frisian languages, with their main clauses SVO and embedded clauses SOV, English retained the SVO order of the main clauses. English shares genderless typology with the West Frisian and Türkic, and shares indiscriminate treatment of female gender with the Scandinavian, Frisian, and Türkic. The absence of grammatical gender indicator is not the same as the absence of the gender, it is the absence of the load of the grammatical gender synchronization between parts of speech. The gender is conveyed by other means without any synchronization effort (Cf. Türkic prepositions er, kïz “man, girl” vs. Eng. she-), and only when it is relevant. As a group, Germanic languages have never digested completely the alien gender grammar. Of the Germanic languages, even though the links are present, German is distinguished by weak and erratic link between declension and gender, gender and sex. Agglutination within the Germanic group varies from minimal to heavy. The erratic typology within the Germanic group is consistent with the substrate/adstrate concept.

If a presence of systemic grammatical gender was used in the definition of the IE family, the size of the family would be greatly diminished.

It is generally believed, with a support of historical examples, that a phenomenon of a widespread multilingualism (bilingualism in a minimal case) is needed to affect a structural change of the language. Under that premise, a linguistic structural change is a manifestation of the multilingualism, it is an effect of amalgamation. History of English allows to discern interference patterns of recurring amalgamations, with Anglo-Saxon carrying a major doze of Türkic lexical and morphological inheritance, Turkisms from other sources (primarily Frisian) overlay the resultant blend with the Anglo-Saxon component, the Romance layer carried over another load of Turkisms, and each one either directly impacted the structure of the English, or created conditions for a needed change. In such dynamic situation no conclusions are obtainable with static methods of discrete universe for each conglutinate language, internal reconstruction, lexicostatistics, or glottochronology, or they are easily vitiated when drawn with such methods.

A syntactical feature shared by the English and Türkic is the wide use of paired words - idioms, frequently used as compound words: horsetail and horse tail, bluegrass and blue grass, etc. Some words have dozens of such pairs, in English blackback, blackball, Blackbeard, etc., in Türkic qara ačı, qara baš - qarabaš, qara boɣuq, etc. Phonetically, the compounds and paired words are indistinguishable, the spelling is purely conventional. And all languages have polysemantic words; English and Türkic lexicons contain thousands of them; and miraculously, frequently the specific polysemantic meanings survived from the substrate Türkic into the modern English: yer/earth are both land and dirt, tili/talk are both a communication process and a story, etc. This mechanical transfer of polysemantic word from a language to language complete with the word's polysemantic meanings is a main indicator of the genetic connection, in contrast with the loanwords, which are borrowed with a specific semantic meaning when a receiving language adopts a word with only a meaning it needs to fill in. Below are cited numerous examples of polysemantic meanings carried over to English.

An univocal syntactical feature shared by English and Türkic is the use of predicate nouns with marker of belonging instead of accusative case: in Türkic “my uncle” with agglutinated suffix, in English “my uncle” with a pronoun instead of an article; thus “father loves her”, but “she loves her father”, and not “she loves father” or “she loves the father” ~ “ella ama a su padre” (Romance), but “she loves father” ~ “îíà ëþáèò îòöà” (Slavic). In Türkic example Huar-Ases (Suar-Ases) > Chorasm > Horezm “(land) of River People”; there -m is the Türkic possessive marker, a forerunner of the English “my”.

Another notable univocal feature that until 19th c. was shared by the English and Türkic is the now forgotten English passival reflexive form, where progressive aspect of a verb (Eng. -ing, Tr. -inč) was used in active voice with passive semantics (“breakfast eating” ~ being eaten), equivalent to the reflexive forms apparently ascending to the European Sprachbund vernaculars that existed in the Central Europe before the crucial events of the 3rd mill. BC. Relicts of that form are preserved in Eng., Dan., Sp., It., Lith., Serb., Pol.: pre- or postposition self/sig/se/si/save/se/sie (ascending to the Tr. öz “self”) respectively, and suffix sya in Russ., all meaning “self" and forming reflexive semantics were the objet is the subject. This now lost English form was creolized very late, in the 19th c.; in the Shakespearean time it was still active, and Johnny could be bathing (i.e. bathing himself), house could be building (i.e. building itself ~ to be under construction), breakfast could be eating (i.e. eat itself ~ be consumed), and lovers could be kissing ( (i.e. kissing themselves ~ kissing each other). In Türkic, the passival form is formed with the suffixes -m/-ïm/-im/-um/-üm and nearly universal stem öz “self”, all denoting “me, mine, self”. The öz, grammatically pre- and postposition, is a forerunner of the English possessive -'s, and European variations of pre- and postpositions “self”.

Till the Middle Ages and the input of foreign writers into the Türkic literary languages mostly via religious translations, Türkic languages did not know relative pronouns. In Türkic, suffixation serve the relative function. That inheritance is still prominent in modern English; while the presence of relative pronouns is characteristic, in particular, of the written Standard English, the standard variety has influenced the dialectal English, but had not taken over. The verbal use of relative pronouns is still subdued, with the absence (zero case) of relative pronouns reaching 30%, and a single particle that taking another 40%; the combined 6 relative pronouns share less than 40%, and that in spite of the absence of the suffixation tool available in the Türkic languages.

Another distinct feature shared by Germanic and Türkic languages is rhotacism. Rhotacism is one of the main traits that separate Oguz (-s-) and Ogur (-r-) languages, to a degree that these groups are defined as an s-language versus r-language. This preeminent trait is continued to the Germanic languages, notable, among others, in the Dutch (Y-DNA R1b 70%, R1a 4%) -s plurals and (by rhotacism) Scandinavian (Swedes, Y-DNA R1b 17%, R1a 21%) -r plurals. The ratios of the Y-DNA components point that Y-DNA R1b marker is correlated with the -s- form, and Y-DNA R1a marker is correlated with the -r- form. The same separation appears among the Türkic ethnicities historically connected with the Oguz (Common Türkic) and Ogur (Karluk subgroup, Uigurs, Uzbeks) groups, and is supported by paleogenetic testing.

Both English and Türkic have a preference for closed syllables ending with a consonant morpheme. That proclivity may be reversed passing through a language with an opposite tendency for open syllables ending with a vowel morpheme. The most frequent adaptation of a closed syllable by the alien linguistic inertia is by adding a prosthetic consonant in front of the initial vowel (VC > CVC). The initial vowel may be dropped (Cf. Türkic apat vs. IE pat “leg”, Cf. podiatry, peddler, pedal). Because such changes are caused by structural and not phonetic adaptations, such adaptations are unexplainable phonetically, as also are unexplainable the cases of metathesis (e.g. ab > ba etc.). The mere observations are routinely stated as exhaustive explanations: “h is often wrongly prefixed to words, as in Cockney English” (Hall J.R.C., 1916, p. 142). In Slavic, and shared by some Germanic languages, a prosthetic s- is systemically appended in front of the root, or is appended a prefix s- marking the perfect tense, the two can be easily confused. The systemic nature of prosthetic consonants is suggested by the mass of cases in the present study, these cases as a paradigmatic body extend a conspicuous diagnostic and tracing opportunity for the direction of borrowing and the nature of the guest and host languages. Systemic inattention to the structural and morphological properties of the languages in favor of the lexical content is perilous for etymological screening.

At each stage on their historical path, communities encountered alien communities and formed their unique lingua franca. Linguistic changes reflected initial idiosyncrasies for each unique situation. Each situation predicated its unique results at the terminal point, which marked a belonging to the previous stage, and a beginning of the next stage. It is impossible to account for all the past stages, for all the past players, for the murky linguistic relations during each stage. With so many largely unknown variables, any generalizations have their uncertainty and defined limits, both of which are critical in significance but can’t be defined for the prehistorical and most of the historical periods. None of the phonetic changes are predictable either backward nor forward. Consequently, the most that can be accurately stated etymologically is the vague “usual phonetic changes” and “few vowel variations”. Without a statistical assessment of the entire body of evidence for each variable parameter encapsulated between two terminal points, any assertion is no more than a hypothesis lacking a factual foundation.

A correlation between centum (k) and satem (s) languages in Europe is similar to the situation with the r/s alternation. In most -k- lands (Celtic, Italic, and Germany), R1a is the significant population component. R1a comes close to a minority in most -s- lands, where it may be compatible with the R1b content (Slavic). Comparing France (cent, -s-, R1b 50%, R1a 10%, I 30%) and Germany (hundert, -k-, R1b 50%, R1a 25%, I 25%), the main difference is with the R1a extent, and it correlates with the switch, from R1b -s- and small R1a, to -k- with greater R1a. It appears that relatively greater R1a tends to shift -s- to -k-. That observation is consistent with the earlier dating of the R1a and the earlier origin of the velar fricative in parallel with k- and q-. The corollary of that process may be a conclusion that the vicinities of the Aral-Caspian area, extending to the north to the Urals and in the south to the Mesopotamia, where the k/s furcation is most pronounced, initially were predominantly peopled by the R1a -k- type populations. The modern predominance of the -s- type in that area was caused by later migrations.

The European s/k divide is shared by the Türkic family, with a mirrored symmetry predicated by the demographic events. Within the Türkic milieu, the divide is more like s/h in today’s Romanized form. The -s- form is now predominant across Eurasia, the -h- form is scattered around the fringes, largely coexisting with and being supplanted by the dominant -s- form, except for the Alats/Khalajes who live in the -h- area. The modern spread of literacy and the influence of the titular languages tend to isolate and dilute the relict -h- islands. Still, the divide is very tangible (the Swadesh 100 list’s examples are from Dybo 2013, total 11 examples): “all, entire” Tat. saw, Bashk. haw; “belly” CT ičiksi, Sary-Yig. hijigï; “cold” Tat. sumk, Bashk. himk; “die” Tat. ül-, Khalaj hil-/hel-; “egg” Chuv. smarda, Dolgan himït; “far” Khalaj hiraq, Yig. iraq; “fat” Tat. simez, Bashk. himeδ; “kill” Tat. üter-, Khalaj hilär-; “thou” Tat. sin, Bashk. hin; “warm” Kumyk isi, Khalaj hissi; “water” Tat. su, Uzb. suv, Bashk. hïw. The -h- examples are limited to the Bashk. (5), Khalaj (4), Sary-Yig. (1), and Dolgan (1) languages, a tiny surviving relict of once dominant linguistic subdivision now extending to the European languages. The divide affords diagnostic value for the European linguistic pedigree.

The spotty bits and pieces hint that R1a are Scythians, Huns, Ases/Alans/Massagets, Kangars, and their lesser kins Bulgars, Suvars, As-Tokhar Yuezhi, called Ogur tribes in the modern linguistic nomenclature. The spotty linguistic traces point in  the same direction, corroborating genetic indicators. Indications for the R1b marker are still blurrier yet, hinting toward the northern Tele tribes and their descendants Oguzes, Kipchaks, Uigurs, and their lesser kins Tele, Teleuts, Telengits, called Oguz tribes in the modern nomenclature. Except for few isolated cases, all populations were genetically mixed, covering diverse constituents under umbrellas of a leading tribe or a common unifying designation, like the Huns “kins”, Saka “mountaineers”, Alans, Ases “flatlanders”, etc.

An interesting case is the suffix -an, which survived in Türkic languages as an abstract noun suffix forming derivative nouns from concrete nouns. In English, -an survived as abstract suffix forming denoun and deverbal adjectives (terranean, distant), and a spectrum of deverbal nouns (appearance, servant, lubricant). Apparently, its initial function was a universal generic abstract applicable to all word classes, and its functional stratification developed at a geographical-dialectal level; it fossilized as an abstract suffix forming verbal infinitives within the Gmc. group (Cf. A.-Sax., Gothic, etc.), while the Balto-Sl. and Indo-Aryan languages adopted for verbal infinitives the Türkic 3 p. sing. suffix marker -t/-ti/-d/-di. Depending on the scenario of the stratification, the ingrained division may be timed by the Balto-Sl. and Indo-Aryan separation, by the 2nd mill BC that division already had a systemic nature that has survived to the present. At first, English had inherited the suffix -an as a marker of verbal infinitives, but eventually abandoned it in favor of the preposition marker to, apparently genetically a reflex of the Balto-Sl. suffix marker -t/-ti/-d/-di. This adaptation is consistent with the documented process of the development of English, and points to the demographic pressure of the Balto-Sl.-type vernaculars in the development of English. The end product of adaptation was a result of a play of competing tendencies between the sedentary majority and waves of intruding minorities, the Normans, Anglo-Saxons, Romans, and so on into pre-historic times.
18, 19, 20, 21, 22, 23

The following Table 3 provides a cross-reference between the Türkic and English suffixes; it demonstrates a morphological continuity between Türkic, Latin, and English suffixation, an indelible example of an extremely massive paradigmatic transfer. That transfer is not only consistent, it is predicated by the archeologically and genetically established nomadic waves flooding the European peninsula starting in the middle of 4th mill. BC. The table lists nearly all active and productive English suffixes, allowing to discern the relative extent of the heritage and innovations. The fields that do not have Türkic entries are loan forms and/or innovations. Quite a significant proportion of the English suffixes is shared between Türkic, Germanic, and English languages. Of the 76 English suffixes, 48 or 63% were inherited from the Türkic mother lode, either directly, or via a Lat./Fr. intermediary.

English has abandoned the complexity of agglutination recorded in the Anglo-Saxon speech. Mastering agglutination allows to express relationships and nuances that without agglutination require a descriptive form, but that is a long process, impossible when the lingua franca is created on the run. Among the lost Anglo-Saxon suffixes are -u and -an, -isc, -e, -re, -an (gen.), -a, -ra, -na, -as (pl.), -an, -ang (negation), -m (poss.); adding these 13 vanished suffixes to the list would make the respective numbers in the Old English 89 and 61, and rise the Türkic suffix component in the Old English to 69%. The comparison illustrates not only the morphological losses, but also the power of heritage and its conservatism.

Table 3a. Türkic–English suffixes
Suffix English Sample English Usage English Etymology Türkic
-a loanwords:
perceived as a part of the stem   -a, forms nouns of verbal stems for result of action named by the stem: yar - yara (cleave - wound)
-able enjoyable; lovable; suitable forms adjectives from verbs with sense of “capable or susceptible of being" Lat. -abilis, -ibilis > Eng. -able, -ible, conflated with “able” (adj.) -bilä (“ability”) after -a/i forms adjectives expressing
1. likeness, reciprocity, proximity
2. instrumental(ity)
3. temporalityy
Ultimately from stem bil- “know”
-al (1) national; historical forms adjectives with sense “of the kind of, pertaining to, having the form or character of” Fr. -alis, Lat. -alis -al, ultimately fr.-alqu “all”: ulus - ulusal (nation - national); -al/-il “with”
-al (2) refusal; denial; arrival forms nouns of action from verbs ME -aille, Fr. -aille, Lat. -alia Ditto
-an terranean, american forms adjectives from nouns Fr. -ain, -en, Lat. -anus -an instr. case
-ance appearance; clearance forms nouns from verbs with sense “characterized by or serving in the capacity of” Fr. -ance, -ence, Lat. -antia, -entia Dittoo
-ant (1) contestant; servant forms nouns with a sense of being someone OFr., Fr. -ant, Lat. -antem Ditto
-ant (2) lubricant; deodorant forms nouns with a sense of being something Ditto Ditto
-ant (3) distant; dormant; pleasant with a sense of doing or being something Ditto Ditto
-ar (1) burglar; scholar forms various nouns including occupations Lat. -arem, -aris -ar/-er “man”
-ar (2) circular; singular forms adjectives with sense “of the kind of, pertaining to, having the form or character of" Ditto -ar active voice
-ate (1) consulate (n); elaborate (adj) forms nouns & adjectives with various meanings OFr., MFr. -at, Lat. -atus, -atum -t abstract noun
-ate (2) populate (v.) forms verbs with various meanings Ditto -t verb voice
-ce once; twice; thrice forms numeric terms indicating a multiplying effect A.-Sax. -ce, adverbial genitive suffix. -ča/-čä (-cha/-chə) adverbial genitive suffix
-cy delicacy; piracy forms abstract nouns from adjectives Lat. -cia, -tia, Gk. -kia, -tia, from stem ending -c- or -t- + -ia abstract ending -č (-ch) abstract noun suffix
-y/i - v. > n.
-cy Nancy, fancy forms diminutive-endearment nouns, adj. Ditto -kïya/-gïyä/-qïia/-qïna distinguishing-diminutive suffix-particle
-ed (1) counted; worked forms past tense and past participle of verbs A.-Sax. -ed, -ad, -od > ME -ed, ONorse -tha, Goth. -da, -ths, OHG -ta, Gmn. -t; Lat. -tus, Gk. -tos, Skt. -tah -da/-δa/-ta(čï), dä/-δä/tä(či) (-də/-δə/-tə) participle suffix
-ed (2) winged; bearded forms adjectives from nouns indicating attributes Ditto -ad/-äd noun > adjectival participle
-da/-δa/-ta(kï), dä/-δä/tä(ki) (-də/-δə/-tə) adjective suffix
-en (1) wooden forms adjectives from nouns indicating attributes A.-Sax. -nian, ONorse -na, Lat. -ine, Sl.-an, Latv. -na, ne -an/-än (-ən) noun instrumental suffix
-en (2) broken; rotten, written forms adjectives from verbs indicating attributes A.-Sax. -nian, ONorse -na, Sl.-an, Latv. -na, ne -an/-än (-ən) verbal adjectival suffix (passive voice)
-en (3) children; oxen forms plurals for some nouns A.-Sax. -nian, ONorse -na -an/-än (-ən) obs. pl.
-ence abstinence; difference a noun suffix equivalent to “-ance", corresponding to the suffix "-ent" in adjectives Fr. -ance, -ence, Lat. -antia, -entia -an/-än (-ən) instr. case
-ent (1) different; absorbent forms adjectives with a sense of doing or being something Fr. -ent, Lat. -entem -an/-än (-ən) obs. verbal adjectival suffix
-ent (2) deterrent; adherent forms nouns with a sense of being something Ditto -an/-än (-ən) noun instrumental suffix
-er (1) teacher; fisher forms adverbs & adjectives of comparison Gmn. -er, Herr “man”, A.-Sax. -ere, ONorthumbr. -are “man who has to do with”, Sw. -are, Dan. -ere -ar/-er “man”
-er (2) older; faster, better, elder forms adverbs & adjectives of comparison A.-Sax. -ra (masc.), -re (fem., neuter), Goth. -iza, OSax., OHG -iro, ONorse -ri, -iro, Gmn. -er -raq/-räk high (absolute) or higher (relative) degree of quality in adj. and adv.
-er (3) soccer, primer   English innovation recycling -er (1), 1860s  
-ery fishery; perfumery; shrubbery forms abstract nouns from other nouns ME -erie, Lat. -arius Türkic yer, yeri (Eng. earth) “place, location” ~ “fish place”
-y/i - v. > n.
-ess stewardess; actress; waitress forms feminine nouns A.-Sax. -icge, Fr. -esse, LLat. -issa, Gk. -issa  
-est oldest; hottest; sexiest forms superlatives Goth. -sts, Du. -st  
-ful (1) doubtful; peaceful; beautiful forms adjectives with a sense of “characterized by" A.-Sax. -full, -ful,“full” (adj.).  
-ful (2) cupful; spoonful forms nouns with a sense of “fullness" Ditto  
-fy beautify; simplify forms verbs with a sense of “to make, to become, cause to be" Fr. -fier, Lat. -ficare “make”  
-hood (1) neighborhood; brotherhood; falsehood forms nouns of things with sense of “character, nature, condition, etc." A.-Sax. -had, Gmn. -heit, Du. -heid, from hade “condition, position, manner, quality” -qut/-ɣut/-gut/-qüt/-ɣït/-güt plural, alp “shooter” ~ alpaɣut “retinue”, bai “rich person, sing.” ~ baiaɣut “rich (people, pl.)”
-hood (2) priesthood; womanhood forms nouns of persons of a class or character Ditto Ditto
-hood (3) childhood; adulthood; boyhood forms nouns indicating a time period in life Ditto Ditto
-ible credible; horrible; contemptible forms adjectives (equivalent to “-able" suffix) Lat. -abilis, -ibilis > Eng. -able, -ible, conflated with “able” (adj.) -bilä after -a/i adjectives expressing
1. likeness, reciprocity, proximity
2. instrumental(ity)
3. temporality
-ic poetic; scientific; artistic forms adjectives with sense of “aptitude, characteristic of, in the style of” Fr. -ique, Lat. -icus, Gk. -ikos -g/-ɣ/-ag/-aɣ/-ïg/-ïɣ/-ig/-iɣ/-ug/
-uɣ/-üg/-oɣ/-ög forms nouns, adj.
-ical electrical; historical forms adj similar to “-ic" suffix, with sense of “having ability or characteristic of ” or “in the style of” Ditto + -al  
-ile docile; volatile forms adjectives with sense of capability or characteristic Fr. -il, Lat. -ilis -ile “with”: “with docility”, “with volatility”
-ing (1) smiling; crying forms present participle verbs that may be used as adjectives A.-Sax. -ende, Gmn. -end, Goth. -and, Lat. -ans, Gk. -on, Skt. -ant -an instr. case
-ing (2) building; sewing forms nouns from verbs expressing the action of the verb or its result, product, etc. A.-Sax. -ing, -ung, ONorse -ing, Du. -ing, Gmn. -ung -ïn (-in), -ïŋ (-iŋ) deverbal noun/adj. result
-ion contrition; suspicion; creation forms nouns denoting condition, process, action, etc. Fr. -ion, Lat. –ionem (part -em does not have etymology, corresponds to Türkic –em/-im) -ön/-öng “space, in front of” > -ion
-ish yellowish; childish, British forms adjectives with sense of “somewhat, rather so, characteristic of" A.-Sax. -isc, ONorse -iskr, Gmn. -isch, Goth. -isks, Gk. -iskos (dimin.) -g/-ɣ/-ag/-aɣ/-ïg/-ïɣ/-ig/-iɣ/-ug/
-uɣ/-üg/-oɣ/-ög forms nouns, adj; -čà/-čä (-cha/-che)
-ism consumerism; alcoholism forms nouns denoting action or practice, state or condition Fr. -isme, Lat. -isma, -ismus, Greek -isma  
-ist dentist; conformist; conservationist forms nouns that denote a person that is concerned with something or holds certain principles Fr. -iste, Lat. -ista, Gr -istes  
-ity capability; diversity; disability forms abstract nouns expressing ability, state or condition OFr. -ite, Lat. -itatem  
-ive active; corrective; restive forms adjectives & nouns expressing tendency, disposition, function, condition, etc. OFr. -if, Lat. -ivus  
-ize customize; fantasize forms verbs with a sense to make, convert into, subject to; give a special character or form Fr. -iser, Lat. -izare, Gr -izein  
-let booklet; droplet; eyelet forms nouns with a sense of smallness or triviality ?  
-ling duckling; hatchling; underling forms nouns with a sense of smallness or being diminutive A.-Sax. -ol, -ul, -el; + -ing  
-ly (1) casually; carefully; gladly; hourly forms adverbs with sense of “how done or when done" A.-Sax. -lic, OFris. -lik, ONorse -ligr, Du. -lijk, OHG -lih, Gmn. -lich -lig/-lan “like”
-ly (2) weekly; fully; locally forms adverbs with sense of similarity A.-Sax. -lice, OFris. -like, ONorse -liga, OSax. -liko, Goth. -leiko, Du. -lijk, OHG -licho, Gmn. -lich, cognate with “like” (adj.) -lig/-lan “like”
-man airman, yeoman forms instrumental nouns Eng. -man, Gmn. -mann -man emphatic “very, main, most”
-ment agreement; judgment; ailment forms nouns denoting an action, condition, product, result, etc. Fr. -ment, Lat. -mentum  
-ness kindness; correctness forms abstract nouns denoting quality, state or condition A.-Sax. -nes(s), OSax. -nissi, Goth. -inassus, MDu. -nisse, Du -nis, OHG -nissa, Gmn. -nis  
-or (1) actor; creditor; juror forms nouns denoting a person who does something or who has some particular function OFr. -our, Fr. -eur, Lat. -orem, -atorem -ar/-er “man”
-or (2) error; pallor; squalor forms nouns denoting action, state or condition, quality or property OFr. -our, Fr. -eur, Lat. -orem, -atorem  
-ous dangerous; glorious forms adjectives with a sense of having a certain quality Fr. -ous, -eux, Lat. -osus  
-ry bravery; jewelry forms abstract nouns from other nouns & adjectives ME -erie, Lat. -arius.  
-ship friendship; censorship forms nouns denoting condition, character, office, skill, etc. A.-Sax. -sciepe, Ang. -scip “state, condition of being", OFris. -skip, ONorse -skapr, Dan. -skab, Du. -schap, Gmn. -schaft, “cognate with “shape”” (speculative etymology) -sig/-siğ/-siɣ denominal and deverbal instr. case “resembling (something)”
-sion decision; invasion nouns denoting condition, process, action, etc. Lat. -s + -io -ta, -te (locatve “in” ) + ön/öng “space, in front of” > -taön, -taön, -taöng, -taöng ~ “in space, in place” > -sion
-sy clumsy, folksy forms denoun adjectives   -si/-sï denoun adj.: yarsı:- “revolted, disgusted”, simulative verbs (“rare”)
-t part, port, unit (atavistic) forms deverbal nouns Lat.  forms deverbal noun (par - part) -t deverbal noun
crept, slept, burnt forms past participle of weak verbs A.-Sax. -ed, -ad, -od > ME -ed, ONorse -tha, Goth. -da, -ths, OHG -ta, Gmn. -t; Lat. -tus, Gk. -tos, Skt. -tah; d/t alteration -da/-δa/-ta(čï), dä/-δä/tä(či) (-də/-δə/-tə) participle suffix
Lat. -t/-te -ta, -te locatve “in”
-tain chieftain, captain forms Lat. ten- “have, hold" > tain, ten, tent, tin  
mister, father obsolete notion of respect  

-ter/-der pl., suffix of respectful appellation

-th (1) birth; death forms nouns of action A.-Sax. -ðu, -ð, ~ Skt. -tati-, Gk. -tet-, Lat. -tati- -ta, -te locatve “in”
-th (2) length; depth; width forms abstract nouns denoting quality or condition A.-Sax. -ðu, -ð, ~ Skt. -tati-, Gk. -tet-, Lat. -tati- Ditto
-th (3) fourth; sixth forms ordinal numbers A.-Sax. -ða, ~ Skt. -thah, Gk. -tos, Lat. -tus Ditto
-tion alteration; location forms abstract nouns Lat. -t/-te + -io -ta, -te (locatve “in” ) + ön/öng “space, in front of” > -taön, -taön, -taöng, -taöng ~ “in space, in place”
-ty (1) loyalty; purity forms adjectives denoting quality, state, condition, etc. ME -tie, -te, OFr. -te, Lat. -tatem ~ Gk. -tes, Skt. -tati- -te, -ta (locatve) > Gk. -tes
-ty (2) twenty; sixty forms numerals denoting multiples of ten Goth. tigjus, ONorse tigir “tens, decades”; A.-Sax. -tig, Du. -tig, OFris. -tich, ONorse -tigr, OHG -zug, Gmn. -zig  
-ure departure; failure forms abstract nouns denoting action, result, agent, instrument or apparatus OFr. -ure, Lat. -ura -r/ur/ür/ir/ïr verbal analytical intrans. base > depart + ur > departure
-y cloudy; dreamy; juicy forms adjectives with sense of “characterized by, inclination, condition" A.-Sax. -ig, Gmn. -ig) ~ Lat. -icus, Gk. -ikos -ig/-iɣ/-ik/-ïg/-ïɣ/-ïk - verbal adjectives
-y/i - v. > n.
-an     A.-Sax. -as -an (pl.)
-s books forming plural nouns A.-Sax. -as, Du. -s plurals, Scand. -r plurals (rhotacism) -s (obs.)
-es ashes forms plural nouns
-ies armies forms plural nouns
-ves calves forms plural nouns
3rd Person Singular Verbs
-s makes; creates forms 3rd person singular verbs A.-Sax. -es, -as, Northumbr. -eð (-eth, voiced) -sa/-sä (sə) predicate of subordinate clause
-es touches; finishes forms 3rd person singular verbs Ditto Ditto
-ies defies; cries forms 3rd person singular verbs Ditto Ditto

English retained from the Türkic ancestry the relict paradigmatic suffixes that lost their active function: stay > stanch, staunch with -an (deverbal continuous action) + -ch (deverbal noun/adj.) “stop (water, process)”, “stay, remain (loyal)” respectively.

English also recycled the Türkic prepositions (phonetical and functional prefixes), postpositions, and suffixes into prefixes, retaining their original function either literally or notionally in a new syntactic typology.

Table 3b. Türkic–English prefixes
Prefix English Sample English Usage English Etymology Türkic
de- debase, defibrillation, dismount, destiny, deserve Locative, from A.-Sax. te-, OSax. ti-, OHG ze-, Gmn. zer-, Lat. de- locative “down, down from, from, off; concerning” -da-/-de-/-ta-/-te- locative “down, down from, from, off; concerning”
dis- dismount, disengage, dishonest, disallow forms nouns, adjectives from verbs with sense of “capable or susceptible of being" A.-Sax. te-, OSax. ti-, OHG ze-, Gmn. zer-, Lat. dis- locative “apart, in a different direction, between, not, un-” Ditto, crasis of the locative -da:/-de:

The morphological and syntactical trail left by the Türkic languages in English and other Germanic languages is so deep and wide that it is embarrassing to think of the reasons why such obvious spoor was left unexplored for so long by the European linguistic community.


The scholarship of the Türkic languages' phonetics is fairly well established. To a limit. Beyond the limit lie unexplored seas and untouched unknown gulfs. Phonetic interdentality is both unknown and ignored. That provides misleadingly simplified scholarly transcriptions yer or yers in lieu of yerth for the “earth”. The English and other Germanic languages suffer from that unexplored gap, especially so because the probable origin of many Germanic substrate Turkisms is connected with the modern Turkmen language and location called Turkistan in the 19th c., and now called Turkmenistan, Uzbekistan, China, and Iran. Labiodentality, bilabiadentality, and labialization, palatalization, velar plosiveness, post-palatal velar plosiveness, and voicing of velar plosives, and some more traits are out of view. Beyond the limit lie the Türkic phonetics endowed with “no single phonemic system for modern Türkic languages” (Sevortyan 1974 vol. 1 p. 51). Transcriptional violations are endemic in Turkology. A jumble of individual ad hoc alphabets was created by individual schools to fit their own phonetic interpretations palatable to their peculiar tongs and ears. The past efforts to Romanize, Sinicize, Arabicize, Cirilicize, Russify Türkic vernaculars adds to the distortion maze. A comprehensive phonemic system would uncover presently obscured phonetic features. Such a system would definitely affect masses of accumulated renditions, uncover substance presently obscured by the poorly fit machinery, and bring new study material to the field.

Three traits dominate Türkic phonetics: division into front (palatal) and back (velar) phonemes, practice of using syllables composed of either front or back phonemes, and practice of composing words of either front or back syllables. In the words, neutral phonemes may complement either front or back phonemes. The result of this simple arrangement, called vowel harmony, is efficient verbal communication with minimized articulation, and efficient written depiction with reduced to no need to indicate trailing vowels. The initial consonant or vowel sets a tenor for the whole word.

The written world that we have inherited approximates the wealth of the spoken language very economically. A written word reduces a spoken world to a some palette with a kit of phonemes that may range from a couple of dozens to few dozens. They usually allow to grasp a meaning of a word, but render specifics of a live speech as a rough sketch. A good example is given by the phoneme n, that in Türkic may be a dental n, palatal n, or a guttural n (ŋ), and can be strengthened to ŋg, ŋğ, ŋk, with further variations. Some phonetic foibles can be rendered by some alphabets and not the others. An element of phonetic ambiguity is always present, especially when rendered by alphabets that formalized a significantly different phonetic system. Any perceived phonetics of the recorded spelling implies a degree of latitude. For the vanished languages, the span of the latitude always rests in a speculative sphere. That makes comparisons of words coming from different times and different languages a sorely imprecise science.

Some peculiarities are connected with the conventions of transcription, with the Romanization of the alphabet, and similar-type peculiarities introduced by the Arabic, Cyrillic, Chinese, and other non-native renditions. Romanization, for example, replaced the interdental unvoiced and voiced fricatives with Roman plosives d and t, Cf. Türkic yer q.v., Gmn. erde, A.-Sax. eorð, Goth. airþa “earth”. The A.-Sax. and Goth. indicate the final consonant fricative, retained in Eng., while the conventional renditions of the Türkic yer do not. That may be accurate for simplified modern pronunciations, but does not reflect the relict and dialectal versions, Cf. simplified spellings for “below”: kodi: (EDT) and qodï, qoδï, qojï, qozï (OTD). Simplification and debasement seriously affect interpretations concerning the r/s/l alternation paradigm, where the arguments on the primacy find tangible support for every scenario. They end up as premises and lead to circular logic conclusions. A fairly clear example of the r/s/l assimilation is set by the sibilant vibrant rz/rs in Polish and Czech that corresponds to either r or s reflexes in contact languages with their own particular phonetic preferences. There, the r may be articulated as r or l, called rhotacism, and s may be articulated z, š, č, and ž, Cf. riz-/ris- in the Türkic rizan/resim and the OHG rizan, and Türkic čiz-, which originated as a stem for “chisel, carve” that later grew to mean “writing”. The r/s/l alternation is shared by numerous languages across linguistic families and across the length of the Eurasia. Numerous independent archeological, biological, and linguistic indicators allow to assess the timing for at least some of the linguistic contacts, providing the r/s/l transitions with timeframe boundaries. The Sl. rz/rs and the r/s split are not two independent puzzles, they are two separate segments of a single development that was probably connected with the pan-European Y-DNA haplogroup I community of the 5th mill. BC. That community initially accommodated the first Kurgan waves, and was reduced and scattered by the following Kurgan waves. In the hideout areas, it amalgamated with diverse locals, stratifying their phonetics into numerous prongs distinguished by peculiar r’s and s’es from the roaring r’s to the mute r’s, and from the whispering and whistling s’es to the thundering shch’s.

All attempts to classify Türkic languages note the Ogur/Oguz divide, with Ogur languages distinct by initial semi-consonant or consonant where Oguz languages start with vowel. The primacy of the Ogur or Oguz version has long been debated, and settled by G. Clauson (1972) with observation that some derivatives have forms with and without anlaut prosthetic phonemes while the stem shared by all languages goes without it. The conclusion is that Oguz languages were older, with a corollary that Oguz-type Tele languages were parental to the Ogur languages. On the other hand, scientific consensus sides with the conclusion of O. Pritsak that Ogur languages predominated in the European part of the Eurasia prior to the 10th c. AD. The tumultuous events of the first 1500 years of our era broke the clear division between Oguz and Ogur languages in the western Eurasia, mixing and amalgamating peoples and languages. That created monumental problems for the linguists, who largely did not see the Ogur component in the Oguz ocean, and ended up classifying the Ogur component as Oguz languages, on the way leaving the Ogur component out of the focus. The conspicuous Oguric properties of the Germanic languages, such as initial prosthetic consonant or truncated suffixes, were dutifully noted and then left without explanation.

Oguric languages are linked with the Aral basin and its western and eastern extensions, south of the forest-steppe belt with its Oguzic languages. Since the Aral basin area was repopulated by the Kurgan Timber Grave nomads from the west and from the east starting at about 1000 BC, the Aral basin's Sprachbund may have evolved by the middle of the 1st mill. BC. Prosthetic consonants present in the few Scythian words attest to their Oguric-type language (e.g. Scythian jilan vs. Oguzic ilan “Snake”), consistent with a tentative Scythian stop-over in the Aral basin on their way from the Altai to the N. Pontic. In the nomadic world, however, it is perilous to presume a static scenario for mobile dynamic populations with their mixes and matches.

The prosthetic initial consonant was noted as typical for the Germanic languages, and it takes a prominent place in English. Cockney is distinguished for adding prosthetic anlaut h- to the English words that exist nicely without it. That is also a noted feature of a group of the Türkic languages, with the prosthetic hard h- appearing in a range of qualities, Romanized as c-, k-, q, and extending to j, y, ch, and even dialectal sh-. An opposite propensity of the historical English phonetics is to drop the initial h-, like turning hlað- “laugh” into lað- “laugh”. These opposing trends are probably associated with the inherited Oguz-Ogur (adding h-) and the Ogur-Oguz (dropping h-) transitions. Taking the initial prosthetic consonant for a part of the root wrecks a havoc with the IE etymology, readily supplying homophonous roots with unrelated semantics then somehow imaginatively linked with the principal word and its semantics. Ignoring the dropped initial consonant of the root wrecks a similar havoc with the IE etymology. These tendencies are systemic, they are fairly easy diagnosed, and at times they serve to help to find a semantically sound alternate explanation. Frequently, the semantics of the suggested Skt. cognates is so far from the semantics of the principal word that just citing them undercuts, rather than confirms, the IE paradigm.

A major alternation that seems not to be connected with the Oguz/Ogur divide is the m-/b- alteration, illustrated by the English pair be and am of the verb “to be”. The m-/b- alteration has very long documented roots, it was very common in Sumerian, where, for example, bal and me are two forms of the verb “to be”. Much later, it is a common Chinese trait. In old and middle Chinese, labials m- and b- were often interchangeable, particularly in transcribing non-Chinese words and names (Chen S. p. 72). In Türkic daughter and affiliated languages b- may take a range of qualities Romanized as p-, v-, and w-, with further variations, like Pahlavi and Hungarian bh alternating to v-, and Russian b- to v-. According to G. Clauson (1972), b- was a prime Türkic consonant, and in respect to b-, the m- alteration was a later development. However, that judgment was drawn mostly form the eastern and quite late sources, and rested on particular historical presumptions. A phonetic review of the unappreciated linguistic relicts from the Sumerians, Scythians, Sarmatians, and Huns, including the still unexplored early Indian sources, may modify or alter that exploratory conclusion. Under the IE umbrella, in modern and medieval Greek the voiced plosive /b/ was written as digraph μπ, while the late Roman/Byzantine authors regularly used β and not μπ for /b/; the Byzantines articulated β and υ as a /b/, and in Classical Greek β stood for /b/. This most unlikely Greek transition, reportedly in late antiquity, from β to μπ (betacism) parallels the most unlikely process asserted for the Türkic languages as a stipulated observation.

One of the observations in the early days of Turkology was that Türkic languages avoid words with initial liquids in general and initial r- in particular. With time, that observation turned into an accepted dogma. By definition, all Türkic words with initial r- were loanwords. That is mostly true for the fairly recent borrowings. A closer inspection would find out that the words with initial r- are habitually articulated with a prosthetic initial vowel, most frequently with a like of a neutral e-/ə-, but also with a and o-/ö-. A more granular observation helps to discern the initial evolution, which in turn allows to visualize the murky origin  of etymologically uncertain lexemes, like erpe:- “cut, cut off” vs. a modern “robe” with assigned most incredible speculated etymology. For obviously very old lexemes, where a pick between a borrowing and an original is a personal opinion, ability to discern a prosthetic morpheme is an indispensable methodical tool.

The immense geographical spread of the Türkic languages inevitably had to develop a series of Türkic Sprachbunds and lingua francas crossing linguistic barriers with uncounted neighbors spread across Eurasia. A single meridional pasturing route extending for a thousand kilometers could have crossed five alien Sprachbund zones, each requiring a doze of bilingualism to keep things smooth and safe. Compounded with periodic reciprocative longitudinal migrations, a doze of commonality and variation was inevitable. That created numerous phonetic versions of the same root, numerous phonetic versions of the same morphological elements, and numerous synonyms for a great number of words. In various degrees they were passed on to the Türkic-substrate and neighboring alien languages. Attempts to systematize the largely stochastic interrelationships between the Türkic languages were many, different schemes tried to use few selected parameters as systemic criteria, but a crisp and clear systematization still escapes all attempts of linguistic classification. During historical period, Scythians carried their language from the Altai to the Aral basin and on to the Near East and Balkans. They were but one group that fell into the limelight of their literate neighbors; other main groups are known from excavations, and many others are not detectable at all. In the next millennium, such examples infringe on the limits of scientific imagination, with Sarmats, Alans, Huns, Kangars, and many others covering awesome distances. In the last millennium, the map was again completely re-drawn. Each stage recombined local and migrant, creating a linguistic picture resembling impressionist paintings, comprehensible from a distance but totally chaotic at a close examination. Few regular correspondences may be asserted and listed, but a great number is doomed to remain unexplained, left “with no close semantic connection (within Türkic languages)”. At such points, etymology switches from a tracing mode to observation mode.

Various traces of the substrate Türkic phonetics still play a salient role in the daughter languages; the deviance of the Germanic languages within the IE family was noted and studied at the dawn of philology. It is especially prominent for the rounded vowels, being in a constant conflict with the Latin-derived alphabets which necessitated a slew of roundabout methods to depict them in writing. English inherited a bouquet of spelling renderings reaped from the neighboring donor languages, and developed a number of its own ways to cope, ending up with a maze of semi-rules and a snarl of related exceptions which supports the industry of spelling education and spelling bee competitions. The Slavic languages, with phonetics bearing heavy substrate traces of the inherited Türkic phonetics, overcame the conflict by creating a set of alphabet graphemes to depict rounded vowels, sibilants and other oddities. The maze of adaptations tends to mask lexical similarities, misleading uninitiated and providing ammunition for jaundiced protagonists.


Unlike the Norse peoples, England has not preserved its sagas, unlike the Slavic folklore she did not incorporate Türkic history and folklore in her literary inheritance, and the most outstanding relict of its linguistic substrate remains the name cockney, in Türkic spelled köken - “motherland, native place, ancestral land”. Sticking out through the fluff of the later fantasies, the earliest reference to the Cockney is a “mythical luxurious country, first recorded in 1305”, a clear reference to the “ancestral land”, which turned out to be not that luxurious, since its inhabitants ended up in the distant foggy Albion and speaking a creolized mixture of Türkic, Norse, and Romance.

The IE etymology is built on the Ursprache Family Tree model, oblivious to the complex historical processes that were cardinally changing the face of the European peninsula during the last 5 millennia, it is little suitable to describe the dynamic linguistic situation during Neolithic and Metal Ages. Numerous vestiges of the past events either do not find reflection in the IE etymology, or are etymologized with most unsuitable phonetical resemblances to force them into the faulty model. The first category leaves about a third of the Germanic lexis classed as “of unknown origin”, the second category artificially creates misleading evidence that contorts the past. Most IE etymologies are circular, departing and arriving at unknowns, with some phonetical manipulations in-between. In the practice of the Family Tree model, the asterisked conjectured words dissolve like the seeds of the trees, without a trace. In real experience, some words survive for millennia unchanged, passing from language to language like precious stones, with all facets intact. In numerous instances Türkic words survived practically in their original forms, and in some instances still retain their Türkic suffixes, allowing to expose delusory etymology and provide a credible authentic source. In many instances examination of cognates provides no additional leads, and serves purely a perfunctory function to justify the ends.

The Romance borrowings in English are attested historically, and they do not need to be specifically examined to determine direction of borrowing. The Germanic substrate in English is also attested historically, and does not require such examination. The remaining part of the English lexis needs such examination, and the results are not always obvious. A common criteria in such examination is the distribution of the cognates: a word is considered to belong to a linguistic family if most branches of the family have cognates of the examined word. If a word does not appear in the majority of the branches, it is held to be a loanword from another family. In case of far separated branches, like Germanic and Indo-Iranian, a word should be found in both legs, otherwise it is held to be a loanword from another family. The same criteria is applied for borrowings between branches, a word should appear in most languages of the branch to be considered to belong to that branch, and a branch that has it in a minority of its languages is held as a receiving branch. As a rule, distribution is not examined, numerous IE etymologies fail to meet the basic criteria, and the IE classification must be dismissed.

Traditional assumptions on the direction of borrowing at times fail such tests. In most cases, a comprehensive listing of cognates clearly defines the direction of the borrowing, and the density of the borrowings between languages, branches, and families is a good indicator of the cultural penetration or influence. The opposite is true oppositely, a selective choice of cognates forms a delusive indicator and leads to special pleading distortion. Other indicators are the semantic meanings. A generic meaning that turns into specific application (e.g. snake vs. cobra), and polysemantic word that retained only a partial meaning indicate that the specific meaning is a loanword. A nearly mechanical transfer of a polysemantic word complete with its multiple discrete meanings indicates assimilation of a paradigm, a genetic connection driven by demographic events with linguistic consequences.

Latin serves as a main protagonist of English, never failed to be cited. Although some words may have existed in Latin long before it became an imperial language, in a temporal historical development Latin is a late newcomer, contemporary with the spread of the Sarmatian languages into the Western Europe, and long after the Scythian migrations into the Northern Europe. The Latin's reach into the depths of the Eurasia had been minimal temporally and spatially. In contrast, the influence of the Kurgan waves on Latin and other European local languages was massive both temporally and spatially. That is attested by numerous Türkic - Latin cognates in lexis and morphology. With such historical background, the direction of borrowing from Latin into Türkic languages must be deemed impossible, while the direction of borrowing from Türkic languages into Latin must be expected. Many Türkic lexemes reached English by two independent paths, one via demographic path generally synchronous with the spread of Turkisms into other Mediterranean and Northern European languages (the R1b path), and the other via late, started at the turn of the eras, cultural influence of Latin on the Northern European languages. The last path is well documented, it coincided with a rise of literacy and the switch to the Latin-based alphabets.
24, 25, 26, 27, 28

In the process of linguistic amalgamation, because any languages have numerous totally unrelated homophonic lexemes, the receptor languages gain homophones from the donor languages, adding semantic meanings to the existing indigenous words. Traces of such amalgamation are found in nearly all languages, and the Türkic languages, due to the nomadic economy that necessitated amalgamation across Eurasia, are especially endowed with polysemantic vocabulary, frequently passed along during following amalgamation cycles, and that includes the English. Numerous homophonic words in English have diverse origin, much of the natural dialectal variability persevered into the printing age, and dictionaries furnish both the allophones from the pre-printing era, and synonyms originated in various languages. Attempts to etymologize them within the ideology of the Family Tree model are overtly artificial and subject to criticism. Practically all following lexical examples have homonyms and innovations in English, they are of little relevance to the present scope.

Restricting semantic latitude filters out homonyms and incredible etymologies. Restrictions need not to be rigid, they should allow for natural transition from a figurative to fossilized semantics where such transition is obvious, at the same time inhibiting an optimistic overpermissiveness. The span and propensity for forming figurative expressions are not mechanically transferable from a language to language, each language or a group of languages finds its own methods of expression and innovation using its own set of grammatical and morphological tools. In contrast with the flexive-type languages, the specifics of the Türkic agglutinative morphology allows phonetically economical production of a wide range of literal and figurative derivatives of the same root extending to the opposite ends of the semantic spectrum. The morphology-enabled semantic jumps are routinely accomplished with minimal phonetical means, allowing liberal reuse of the same root (Cf. English raise vs. rase). The roots qab “vessel” or dür- “stand” are good examples of the width and limits of the semantic latitude.

Random examples of linguistic layers in English and sister languages deeper than the Middle Age cultural borrowings are compiled in the Table 4, with comments appended. A thorough examination of the English lexicon should locate many more; only semantically distinct verbal, noun, or adjective forms are listed, so an expanded listing with complementary forms and derivatives would be 3-5 times more extensive. The specifically Chuvash cognates are explicated following V.Stetsyuk, 2003. Chuvash is variously classed as Oguric and as independent branch.

The only words included in the OTD are those traced in an actual medieval Turkic text of the eastern provenance, in the earlier period they occur as cognate loan-words in foreign languages in forms indicative of early borrowing. The texts indexed in OTD are a fraction of those which once existed, and undoubtedly other words existed but did not happen to occur in the used surviving texts. Some words occur in medieval texts or in modern Turkic languages. A large number of early words are hapax legomena, they occur only once, and have not been found elsewhere; it is impossible to determine whether hapax legomena are correctly transcribed or interpreted. The perfect tense transcription of the verbs tends to be most accurate, but the OTD does not always explicitly delve into alternate transcriptions or interpretations.

The concept of paradigmatic transfer defines borrowing of some entire complex of features. Paradigmacity literally saturate English Turkisms. Lexical paradigmacity is the most visible trait, at times an entire complement of lexemes is transferred from the substrate language to the daughter language. Those are most powerful examples of the genetic connection, and typically they ascend to the most archaic linguistic layers, like cooking food. Cooking may be done by heating on fire, by smoking, by hot ashes or charcoals, and by boiling. Boiling appear as steaming and burling (churning, purling); thus the lexeme for steaming should be a derivative of steam (bu) or burling (qat-, qatïn-, qatna- “purl, purling”); accordingly, we have words like boil (lit. “steamed” fr. bu- “steam” with passive suffix -l- ), bouillon (lit. “steamed” fr. bu- “steam” with passive suffix -l- and abstract noun passive suffix -on) and kitchen (qatïn- “boil, purl, cook”). Analogous processes are cooking with hot ashes or charcoals, the lexeme for “cooking” should be a derivative of ash (kok-) “(cook food) over fire or smoke” (lit. “ash, ash-burn (it)”, “smoke (it)”). Such complex transfers of the entire paradigm vividly demonstrate multi-faceted genetic connection.

The paradigmatic transfer of the entire cooking lexicon corroborates the genetic findings that the Pit Grave Kurgan migrations were not a series of military raids, but a series of cohesive massive relocations of the tribal societies, with the women being instrumental in relocation of the cooking lexicon. The complex nature of relocation is also corroborated by the paradigmatic transfer of the animal-related lexicon, probably largely carried by the male population, especially so in respect to the wild animals. The replacement of the pre-Corded Ware farming population marked by lactose intolerance by the steppe Kurganians marked by lactose tolerance coincides with the period of the Central European “killing fields”. Since the appearance and dissemination of the biological lactose tolerance in a population is a slow process that requires millenniums to build up, the societies that brought over the trait of the lactose tolerance to the Central Europe must have had millenniums of cohesive development with the diary as a staple food necessary for subsistence of the steppe herders. The steppe lexis replanted to the Corded Ware soil attests to the cohesiveness and endurance of the steppe migrant population and their language. The Central Europe was not a sole destination of the Kurgan waves. Reflexes of the same lexicon recorded in the Sumer tablets point to Mesopotamia as another migratory spur, possibly connected with the southern, Maikop area (ca. 3700 BC—3000 BC) of the Kurgan cultures.

Some of the “unknown origin” words showed up as American English, they are dated to the 18th c. (i.e. the first records) or later. On arrival to America, those Brits and their companions used plenty of Türkic (Cf. boss, chunk, derrick, ok, toilet). Some of the American English lexicon points to the nomadic animal herders (Cf. tire). Before surfacing in the New World, the American Türkic had to be lurking in England for nearly two millenniums.

The Türkic orthography in Table 4, taken from different incompatible sources, is mostly adjusted for phonetical clarity: c = j in jet, č = ch in cheap, y (OTD ï, EDT Turkish ı) = i in sit, ü = u in mule, ä, ə (EDT Turkish e indistinguishable fr. e) = a in apple, ö = o in champignon, š = sh in she, ɣ = voiced guttural g (go), ŋ = ng in ping, δ (EDT Turkish d) = voiced interdental th, Germanic and Horezmian þ, usually rendered as t, d, s, or f depending on Roman or Cyrillic rendering.

Table 4. Türkic–English lexical correspondences  













1 abode oba case (instance) qaza essen (Gmn.) ash     mallet maltu skin saɣrï
2 above up- cash kečä ether äsir     mama mamü sling salïŋu
3 abundant (adj.) abadan (adj.) cast (hurl) (v.) kus- (v.) Europe ev + opa     mammal meme skull kelle
4 abysm abamu cast (form) (v.) qïsdï (v.) evacuate (v.) evük- (v.)     man men smile (v., n.) semeye (v.)
5 access (v., n.) ačsa:- castle kishlak Eve eve eve     many munča (adv.) so (adv.) aša (adv.)
6 ache àčï casualty közün- evict (v.) evük- (v.)     mantra maŋra- (v.) soak (v.) saɣ- (v.)
7 acid (n., adj.) àčï- (v.) cat četük evil (adj., n.) uvul-     marasmus maraz soap sabun
8 acorn yaɣaq category qatïɣ (adj.) evoke (v.) evük- (v.)     mare ma: sock (beating, v.) sok- (v.)
9 act (v.) aqtar- (v.) cattle katıl ewe eve     massif basɣuq sock (stocking) sok- (v.)
10 ad öt cause köze:- exhaust qoxša- (v.)     master bögü: socket sok- (v.)
11 Adam adam cavalry keväl eye ög- (v.)     matt (adj.) mat (adj.) sodden (adj.)) sod
12 again aga (adj.) cave kovı: face yü:z     me (pron.) min (pron.) some kim (morph.)
13 agaze ög- (v.) cavern kovı: faith vara     mead mir son song
14 age aga cavity kovı: false al- (v.)     mean (v.) many (mahny) sooth čïn
15 agile ačïl Celt kel- (v.) far (adv., adj., n.) ıra:-     means min sorrel (adj.) sary (adj.)
16 ago (adj., adv.) aga (adj.) cemetery semäklä- (v.) fare (v., n.) faqr(lïq)     mengir meŋgü squat (v.) čat- (v.)
17 aggrieve aɣrï chaff čob fart burut- (v.)     mental (adj.) meŋtä (adj.) squeeze (v.) qis- [qys-] (v.)
18 aid jarï chagrin qadɣur father ata     menu meŋ stair šatu
19 aim amač chalant (adj.) čalaŋt (adj.) feel (v., n.) bil-     mere mürän stale (v.) si:t- (v.)
20 akin (adj.) yakin (adj.) challenge (v.) čalïš- (v.) feeling bilin-     message muštu stay (v., n.) üstü- (v.)
21 alimentation alım chalk chol find (v.) yind- (v.)     mickle (adj., n.) mig (n., adj., adv.) stick (v.) tik- (v.)
22 alimony alım champ (v.) čap- (v.) fire (v., n.) bur-     might bögü step (v., n.) tep- (v.)
23 all (n., adj.) alqu (n., adj.) chapman čıp + man first bir     milk meme stop top
24 Alban àlban (n., adj.) char (v.) öčür- (v.) fissure öz     mind ming suave šuvlaŋ
25 alms almak Charlemagne Charla-mag flask baklaga     mint (v., n.) manat subliminal (adj.) sumlîm (adj.)
26 ambush (v., n.) buš- (v.) chastise (v.) kast- (v.) folk bölük     mist muz suck (v.) saɣ- (v.)
27 amen (adj.) ämin (adj.) chat (v.) čat- (v.) food apat     mock (v.) -mak sundry (adj.) sandrı:- (v.)
28 amorous amran- chattel čatïl foot but     model -mak sure (adj.) sürek (adj.)
29 -an (pl.) -an (morph.) chatter (v., n.) čatu:r (v.) frog baga     Mohn (Gmn.) “poppy” mäkän surrender (v.) süründi- (v.)
30 analogue anlayu (adv.) cheap (adj.) čıp (adj.) full (adj.) yӧlä-     moisture mayi susurrate (v.) šar šar (v., n., adj.)
31 and (conj.) anta: check chek gabble (v.) gap- (v.)     monastery manastar suture sač
32 anvil andal cheek čaak gadding qad     money manat swear (v.) vara- (n.)
33 anger (v.) özak (adj.) cherub čebär gaffe ɣafillïq     moose mus sweep (v.) süpür- (v.)
34 anguish özak (adj.) chew (v.) kev- gaggle (v.) qaɣ quɣ- (v.)     mother mamü swell siwel
35 antler anten child koldaš gain gänč     mount (v.) mün- (v.) swill ašbar
36 any (adj., adv., pron.) ne: chill (v., n.) čil gamut (adv.) qamit (adv.)     mountain mün- (v.) tab tap- (v.)
37 apian arï chintz čit garden karta     mouse muš tablet tü:b
38 apt apt chip čïp gaze (v.) giz- (v.)     much munča (adv.) taco toqüč
39 arch arca chirp (v., n.) čïlra (v., n.) gene gen- (v.)     munch (v.) meŋ tack (v., n.) tak- (v.)
40 archaic (adj.) arca chisel (v.) čiz- (v.) gentle (adj.) yinč- (adj.)     muscle muš tad tat
41 ard or chitchat (v., n.) čit čat (v.) genu yinčür- (v.)     murky (adj.) mürki (adj.) tag toqu
42 ardent arzu (n.) chop (v., n.) čop- (v.) get (v.) qay- (v.)     my -m take (v., n.) teg-
43 are (v.) -ar (v., n.) chunk sïŋuq gibber (v.) gep- (v.)     nag (v., n.) öyäz tale tili- (v., n.)
44 argue (v.) arqu- (v.) chute čüm- (v.) gift kiv- (v.)     nascence ña:š talk (v., n.) tili- (v., n.)
45 arrogant (adj.) orı: circle sürkülä (v.) gird (v.) qur- (v.)     needle ine tall (adj.) tal
46 Arthur artur- (v.) clan oglan/ulan girl kyr     nose ñü:z tally (v., n.) tili- (v., n.)
47 as (adv.) aδïn (adv.) clinch (v.) qïlinč (v.) gist göz     not (interj.) ne (part.) tambourine tambur
48 As Yazï cloud bulut give kiv- (v.)     oat ot tap (v., n.) tap- (n.)
49 ashlar aslïq- coach (v.) köch (v.) glue (v., n.) yelïm     oath ötä- (v.) tap (v.) tap- (v.)
50 asp äväs coagulate (v.) qoyul- (v.) gluten yelïm     obturate (v.) tiy- (v.) tar (v.) ter- (v.)
51 ass eš(äk) coal kül/köl glut oglït- (v.)     ocean ӧkän tariff tarïɣ
52 asquint qïŋïr (n., adj.) coat gömlek gnat öyäz     ofett (OE) apat tart (adj.) tarqa (n.)
53 assess asiɣ cob kev- gnaw öyäz     ogle (v.) ög- (v.) tasse (Gmn.) tas/taz
54 assign (v.) asïɣ cock (latch) kök go (v.) git     ok (interj.) ok (interj.) taste (v., n.) tat- (v.)
55 astute (adj.) asurtɣuq (adj.) cock (rooster) kök goat käči     old (adj.) ol- (adj.) tasty tati (adj.)
56 at (prep.) at- (v.) cockney köken God kut     omen aman (adj.) tavern tavar
57 attach (v.) atkan- (v.) coffin kovı: gold al(tun)     omni- (pref.) omqï tell (v.) tili (v., n.)
58 Augean aqür cold xaltarä good kut     on (prep.) on- (v.) tend taya
59 aught ot (adj.) collect (v.) kölar (v.) goose qaz     once ön (adv.) terrain ter- (v.)
60 augur (v.) ay- (v.) colon kolon gore (v.) göres- (v.)     onus önüs (adj.) testament tutsuğ
61 aurora yar- color kula Gorgon qörq-     orate (v.) orı: (n.) testicles tasaq
62 awe (v.) ö- (v.) colossal (adj.) qolusuz grave kör     ore öre: that šu (pron.)
63 awhile (adv.) äwwäl (adv.) comb (v., n.) kem-(v.) gravy kurun     ortho- (adj. prefix) örti- (v.) theriacum tiryak
64 axle i:k come qam- (v.) greave kurun     other ötürü (adj.) thick sik
65 baby bebi con (v.) qun- (v.) grey (adj.) ğır (adj.)     otter ätär thief tef
66 bad (adj.) bäd (adj.) con- (pref.) kon- (n., adj.) grind (v.) qïr-     ought ötä think (v.) saq-
67 bag bag confer (v.) ber- (v.) groom görüm(čï)     over up- this šu (pron.)
68 baize bez coney, cony kuyan guard qur- (v.)     owe (v.) oye- (v.) thread telu- (v., n.)
69 bake (v.) baka:č (n.)     guess (v., n.) us- (v., n.)     owl aba(qulaq) thrive (v.) tir- (v.)
70 bald bül (adj.) cook kok- (v.) guest göster     own (v.) oye- (v.) throne tören
71 bale (v., n.) bele- (v.) copious (adj.) köp (adj.) gut kut     ox öküz tick (v., n.) tiki
72 band (v., n.) ba- (v.) cork kairy habitat oba     ooze (v.) ӧz (v.) tick tik- (v.)
73 bane < pata corset qursa hack (v.) kes- (v.)     pan (n., v.) ban tie (v., n.) taŋ- (v.)
74 barge (v.) bart (adv.) cost kı:z hador (OE) xatär     papa baba/babai till (v.) til- (v.)
75 barge (n.) barq count köni hah qatur (v.)     pat pata (v.) till (adv.) teg (adv.)
76 bark (v.) ver courage kür (adj.) hag karga     peace barısh time timin (adv.)
77 bark barq court qur- (v.) hard (adj.) qart (adj.)     penny peneg tire (v.) tur- (v.)
78 barley ~ Türkic arpa, urba cousin qazïn hare horan     period ö:d tire (n.) teyir
79 barn ambar cove kovı: hash ash     phlegm balgam tit tiši:
80 baron baryn cow coy haze häzl     pole (n., v.) bal- (v.) tit for tat (phr.) tite tit (phr.)
81 bastard bas + tard cowl kalpak heap kip     port bar- (v.) to (prep.) tu- (v.)
82 bash (v.) baš (v.) coy (adj.) köy- (v.) heart chäre     pot patır toe toy
83 bat (v.) pata (v.) crime krmšuhn (v.) hall qalïq     pour (v.) pür toilet tölet
84 bath (v.) bat (v.) crop körpä: Heimat (Gmn.) xajmatläx     prior (adj.) ür toll tol
85 battle pata- (v.) crock kurun Helen ellen- (v.)     puppy papak tomb tumlu
86 bazaar baz crow karga hell qalïq     purge (v.) pür- (v.) too de (adv.)
87 be (v.) buol- (v.) crunch (v.) qurt (v.) herb arpa:     purl (v.) bu:r- (v.) tool tolɣa- (v.)
88 bear (v.) ber- (v.) crust kairy herd kert     purse bursaŋ tooth tiš
89 bear böri cry qïqïr- (v.) hernia urra     push (v.) puš- (v.) top töpü
90 been (v.) buol- (v.) cuddle (v.) koy- (v.) hey (interj.) ay (interj.)     pussy (n., v.) päsi (n.) topple topul
91 beetle bit cue hide (v.) qoy- (v.)     put (v.) put- (v.) tor tärä
92 Belgi (adj.) Belgü (adj.) cull (v.) čul- (v.) hide (n.) qujqa     pyre bur- (v.) torah tör
93 bellow (v.) belä- (v.) culture kültür- (v.) hilarious (adj.) güleryüz (adj.)     quake bez- (v.) toss (v.) toš- (v.)
94 belt bel cup kap hit (v., n.) it- (v.)     quality qïlïɣ total tutuš
95 berm bürma curb (n., v.) kır hole (n.) ol- (v.)     quantity köni touch (v., n.) toqï (v.)
96 bestow bağıš- cure (fix, v.) kur- (v.) homeland xajmatläx     quarrel qaršï tower türma
97 bet (v., n.) büt- (v.) cure (food, v.) kuri:- (v.) hooligan qïčür- (v.)     quaver bez- (v.) trash (v., n.) tarıš
98 big (adj., adv.) big curd ko:r hoopoe üpüp     queen yeŋä tree terek
99 bill (v., n.) bil- (v.) curt (adj.) qïrt (adj.) horse arɣun     question kušku tremble (v.) četre (v.)
100 bill bilä- (v.) curve qarvï (adj.) host göster     queue trust dörs (t)
101 bitch bi curse qur- (v.) house quš     quilt kübil truth dürüst
102 black belä- (v.) curtain qur- (v.) how qalï     quim em tsk a click
103 blade baldu cut kes- (v.) howl (v.) yel     quit (v.) ket- tuber tü:b
104 blend bulɣa- (v.) cytren (OE) xitren hurl ur- (v.)     quite (emph.) ked (emph.) tuck (v.) takın- (v.)
105 board batɣa dad dedä Goth. holon,
ur- (v.)     rate ruzi- (v.) tumulus tumlu
106 bode (v.) bodi dam da:m hut quš     ration (v.) ruzi- (v.) turkey turuhtan
107 bodega butïq damp (adj.) dymly (adj.) I (arch. ic) ič (es)     regal (adj.) arïɣ (adj.) turf ter- (v.)
108 body bod dash (v., n.) taš- (v.) ideal (adj.) edil (adj.)     robe rop turn (v.) tön (v.)
109 bog bog dash (n.) taš- (n.) idle ytla     -s (pl.) -z (morph.) twat tat
110 bogus (adj.) bögüš (adj.) day dün idyl (adj.) edil (adj.)     -'s (poss.) -si (morph.) twist (v., n.) tevir-
111 boil bula- (v.) dawn tang (taŋ) ignite (v.) yaq- (v.)     sack sak udder ud
112 bold palt dear (adj.) terim (adv.) ilk ilk     sag (v.) sök- (v.) uh yah (interj.)
113 bong böŋ deem (v.) demek (v.) in (prep.) in (n.)     saga savag- (v.) ulan oglan/ulan
114 boot bot deep dip inch (n., v.) ınča (adj.)     sagacity sag ululate (v.) ulï- (v.)
115 booze (v.) buz (v.) deliver (v.) döle- (v.) inn (n.) i:n (n.)     sage sag un- an- (morph.)
116 bore (v.) bur- (v.) delve (v.) del- (v.) -ish čà/čä     sail (v.) salla (v.) unite (v.) una- (v.)
117 borrow (v.) borğ dementia dumur itch (v., n.) êichi (v.)     salary salɣa (v.) under (adv.) aŋıttır- (v.)
118 Boris böri derma deri itinerate (v.) ïd- (v.)     saldo salɣa (v.) undies andarak
119 boss boš (adj.) derrick terek jabber (v.) gap- (v.)     sale sal- (v.) until (Prep., Conj.) anta
120 botch (v.) bud- (v.) descend (v.) düšen (v.) jack (v., adj.) cak- (v.)     saliva liš us (pronoun) ös (pronoun)
121 bother (v.) busa:- diadem didim jag čak(k)     sallow (adj.) sary (adj.) use (v., n.) tusu (v., n.)
122 bouillon bula- (v.) dick dık- (v.) jaggery yaɣïz (adj.)     sane san- (v.) usher (v.) üšer (v.)
123 bound (adj.) baglandi (adj.) dike dık- (v.) jam jem     sanity san- (v.) vacate (v.) evük- (v.)
124 boutique butïq din tîŋ jar jart     sanitary (adj.) esan (adj.) vacuum evük- (v.)
125 bow boq- (v.) dingdong daŋ doŋ jar (v.) jar- (v.)     sapient (adj.) savan (adj.) valerian pultäran
126 box boɣ dip dip jaw jaŋaq     sapphire sepahir vampire ubyr
127 boy bo:y divide (v.) dil- jeer (v.) jer- (v.)     sari sarïl (v.) vat but
128 brain beini division dil- jelly yelïm     sash saču: Vesen (Gmn.) “bran” pečen
129 breath bu:r divvy (v.) dil- jerk (v.) jul (v.)     satisfy (v.) satsa- (v.) voe uvy (interj.)
130 brother birader do tu- jig (v.) jïq (v.)     satyr satir vouch (v.) buč- (v.)
131 bruise (v., n.) bürt, bert dog dayğa:n jig (n.) jig     savant savčï (v.) voucher vučuŋ
132 brute börü doll döl jog (v.) jag (v.)     savory saɣur (v.) wake vak
133 bucket but don (v.) ton- (v., n.) joke elük     savvy savan (adj.) ware tavar
134 bud buqüq drag (v.) tar- (v.) jolly (adj.) yol     say (v.) söy- (v.) was var- (v.)
135 bulge (v., n.) beleg (n.) dumb (adj.) dumur journey jorï (v.)     scare qor watch (v.) aɣtur- (v.)
136 bull (edict) bel dune dun judge ayg- (v.)     scatter (v.) ta:r- (v.) wax avus
137 bull (animal) buqa dung tuŋ- (v.) juice     schabracke (Gmn.) cheprak we (pron.) ös (pron.)
138 bunch (v., n.) bunča (adv.) durable dür- (v.) junk ček     sconce quč- Wermut (Gmn.) armuti
139 bundle (v., n.) bandur- (v.) duration dür- (v.) jut (v.) jalt (adj.)     sea si whelp bala:p
140 burg balïq duress dür- (v.) keen (adj., v., n.) qïn- (v.)     seat čıj- (v.) while (n., v., conj.) äwwäl (adv.)
141 burl burnï dust doz keep (v., n.) kap-     secede ses- (v.) whip yip
142 bursary bursaŋ dye dawa Kent keŋit- (v.)     second eki wife ebi
143 bust basta ea (OE) aq- (v.) key kirit     secret soqru wig yü:g
144 butt büt earl yarlïqa- (v.) kick (v., n.) kik- (v.)     sector chektür wiggle (v., n.) ügril- (v.)
145 cab qab/qap early (adv.) ertä- (adv.) kill (v.) öl- (v.)     see süz- (v.) will (v., n.) bil- (v.)
146 cabbage qabaq earn (v.) ar- (v.) kilter kel- (v.)     select seč- (v.) wise vidya
147 cackle (v.) kakla- (v.) Earth Yer kin kin/kun/kün     sell (v.) sal- (v.) wlita (face, obs.) bet
148 cadre kadaš eat (v.) ye- (v.) kind (adj.) keŋ (adj.)     sepia sepi- (v.) wolf börü
149 caginess qïjïm eave ev king kengu     seize (v.) sız- (v.) woo (v.)
150 cairn kayır eke eken (v.) kitchen qatna-     seizure sïzğur- (v.) world àbïl
151 cake kek elbow el knack qan- (v.)     sever (v.) sevrä- (v.) wormwood armuti
152 call qol eligible (adj.) elïg- (v., n.) know (v.) köni (v.)     sew (v.) suk- (v.) worse (adj.) uvy (interj.)
153 calm (v., n., adj.) kam- (v.) elite elga- (v.) laber (OE) “thistle” läbär     shade čadïn would (v.) 'yu
154 callous (adj., v.) qal (adj., v.) elite elit- (v.) lamb -la:n     shake silk- (v.) wrinkle (n., v.) burki:
155 calumny čulvu elk elik lame (adj.) ulam (adj.)     sharp (adj.) šarp (adj.) write (v.) 'rizan (v.)
156 can (jug) kanata ell el land elen < el     shatter (v.) ta:r- (v.) 'd (would) yu
157 can (v.) qan (v.) elm ilm language luɣat     she (pron.) šu (shu) (pron.) yacht yaɣ- (v.)
158 candle kandil en- (prefix, prepos.) en- (v., prepos.) laugh gül- (v.)     sheep sıp yah (interj.) yah (interj.)
159 cap kap endure endür- (v.) leader elit-     shield čyt (chyt) yard qur- (v.)
160 capture (v., n.) hapset enge (adj.) (OE) özak (adj.) leak liš     shilling sheleg yeah yah (interj.)
161 car köl- (v.) engine ïjïn- leather eldiri     shit (v., n.) šıč- yell (v.) yel (v.)
162 caragana qaraqan enigma tanığma less (adv.) es- (adv.)     shock (v., n.) šok- (v.) yep yah (interj.)
163 care qorq -er er (morph.) lie (v.) yalgan (v.)     short (adj.) qïrt (adj.) yes yah (OFris.)
164 carnival kerme Erbse (Gmn.) arpa loaf lavāš     shove (v.) sav- (v.) yet (adv.) yet- (v.)
165 carpus qarï Erik erk luck àlïč     sick (adj.) (ill) sık- (v) yield (n., v.) yılkı:
166 carve (v.) kert- (v.) equal (adj.) egil (adj.) lull (v.) ulï- (v.)     sick (v., n.) (vomit) sök- (v.) you (pron.) -üŋ (pron.)
167 case (box) kečä     lullaby balu baju     sicker (v.) sarq- (v.) young yangi:
168         might bögü:     sin cin (jin) Yule yol
169         make (v.) -mak     sinew siŋir youth (n., adj.) yaš (adj.)
170                 sing (v.) siŋ- (v.) yuck (excl.) yek (n., adj.)
171                 sink (v.) siŋ- (v.) yummy (adj.) yemiš (adj.)
                  sip (v.) syp (v.)    
                  sit (v.) čıj- (v.)    
Σ = 803 -39 +46

Etymological notes

Numerous Türkic roots are extremely productive, they engendered multiple series of derivatives. A selection of a keyword for productive stems is somewhat arbitrary, and likewise the selection of the contrasting derivatives is arbitrary, only to demonstrate contrasting semantics, Cf. cup, cap, cab, cabbage. Categorical breakdown is much conditional, since all words carry more than one meaning, applied in different contexts and frequently metaphorically, like the “daily bread” may not at all refer to either daily nor bread. It would be naïve to search for exact correspondence between any two figurative constructions. The abstract categories of Social, Religious, and Life tend to overlap in more than one dimension. Every category as a whole constitutes a paradigmatic transfer, an ensemble of thematic vocabulary transferred from the Türkic linguistic group to English; it is an indelible evidence of common genetic origin. Every effort has been made to make each entry of Etymological notes self-sufficient without a need to visit other entries, references are given only when the needed explanations are very extensive; that caused repetitive information scattered throughout the Etymological notes section, especially so when the entries are etymologically connected. When applicable, for reference, the Swadesh List number, Frequency rating and percentages are shown in parentheses: (Sw3, F37, 0.57%).

1. General, p. 28 2. Morphology, p. 35 3. Verbs, p. 37 4.1 Body, p. 65 5. Adjectives, p. 122 6. Other, p. 130
1.1 Pers. Pronouns     4.2 Dress, p. 71    
      4.3 Social, p. 74    
      4.4 Religious, p. 89    
      4.5 Commercial, p. 93    
      4.6 Household, p. 94    
      4.7 Cooking and food, p. 95    
      4.8 Animals, p. 96    
      4.9 Life, p. 97    

Tatsiz türk bolmaz bašsïz börk bolmaz
No Türks without aliens as no hats without head
Íåò Òþðîê áåç èíîçåìöåâ, à øàïêè — áåç ãîëîâû

(ÌÊ II 281)

Some common linguistic terms:
allophone - various acoustically different forms of the same words or phonemes in systemically related languages irrespective of the source language, e.g. “no, not”, Türkic ne, ni, Lat. non, A.-Sax. na, no, Welsh nid, Hu. nem, Gujarati na, nathi, Kor. ani, Jap. janai; allophone is a type of correlator, linguistically a word in mutual, complementary, or reciprocal relation
anlaut - first sound of a word or syllable
auslaut - last sound of a word or syllable
cognate - same words in different languages, presumably directly or indirectly descending from the same ancestor root
inlaut - middle sound of a word or syllable
lexicon - a set of words in the language
lexis - all meaningful word forms and grammatical functions of the language
morphology - practice of forming words
syntax - arrangement of words in sentences
synonym - different interchangeable words in the same language
translation - communication in another language of the meaning in a source language

Preliminary Note. In the context of this work, “cognate” is a word cited in the reference sources nearly exclusively limited to the IE etymology and thus exceedingly presumptive for the content of this work. The “cognate's” doctrinal precondition “descending from the same ancestor language” is a special pleading limitation. It burdens any study with an imposed doctrinal concept of a Family Tree model's single parental language. There is no evidence that a single “ancestor language” ever existed for any language. When they were first documented, all Germanic languages were amalgamated languages with a substrate and an adstrate, and treating them as single-sourced offshoots is methodologically faulty. Since the objective of this work is to establish ancestral connection, the term “cognate” is used provisionally subject to a critical review and examination. It may refer to any semantically and phonetically suitable candidate, and improperly to unsuitable candidates. The sibling words from different languages may stray quite far apart both semantically and phonetically. The term “allophone” describes the acoustically varied sibling forms equidistant from the same word of the long-gone vernaculars that used them in the prehistoric times, long before they were ever recorded. A typical Türkic word may have from as little as few to as many as as a dozen allophonic forms, some acoustically quite distant. A typical Etymological note lists IE cognates as a subset of allophones, complemented with a set of other allophones, e.g. for “big” the Gmc. cognates micel, bugge, and the non-Gmc. allophonic forms big/mig, bi:g/mi:g, ben, bog, min, mi:n, müg, pin.

For clarity, prefixes and suffixes may be parsed with brackets to show the stem.

1. General (few salient words supposedly present in any language) (32)

English and (conj.) “plus, also, too” (Sw204, F6 1.82%) ~ Türkic anta: (locative-ablative pronoun, adverb) “there, then”, “that, from there, from that”. Semantically, the Türkic anta is a locative synonym of the verb bul-, bol- “be”, a precursor of the Eng. be (Cf. Eng. heterogeneous be and is), and a notion “also”, expressed with available lexis. Ultimately, a derivative of an emphatic an “there, now, here” that serves as auxiliary adverb + directional suffix -da/-ta. The form anta: is one of the allophones of antaq, where the suffix -taq/-teq/-tiq is a derivative of a most popular and very universal verb teq- “touch, relate” expressing physical or analytical connection. Semantically, the Türkic anta is a locative synonym of the verb bul-, bol- “be”, a precursor of the Eng. be (Cf. Eng. heterogeneous be and is) and a notion “also”, expressed with available lexis. The anlaut vowel wobbles between a-, o- and u-, the second consonant wobbles between -t- and -d-. Distribution of the words an (ana) and anta: is truly Eurasian, from Fennic to OPers. and Indo-European, and to Manchu/Tungus and Korean. The innovation of the semantic extension is peculiar to the Gmc. branch. The A.-Sax. and, ant, ont, unt originally meant “thereupon, next”, essentially versions of “there, then”, semantically extended to mean “and, but”. The A.-Sax. form, in turn, is speculated to originate from a “PG” *unda, a transparent allophonic form of the Türkic anta. With the same credibility the origin can be ascended to the Proto-Fennic or Proto-Manchu. Cognates: A.-Sax. and, ant-, ont-, unt- “thereupon, next, and”, OSax. endi, OFris. anda, ONorse enn, OHG enti, Goth. anda “along”, MDu ende, Gmn. und “and”; Ir. ann, Gaelic ann, Welsh yna, all “there”; OLith. anta “on to”; Hett. anta “there, to that”, andan “to there, in this, in that”; LAv., OPers. ana “this”, and “here”; Buryat ene “this”; other linguistic families are noted in Sevortyan 1980, vol. 3 pp. 147, 149, 173, 456. Distribution covers entire Eurasian Steppe Belt with widely dispersed fringes. There is no need for the faux “PG proto-word” *unda “?” or “PIE” *en “in”. Under an PIE model, the ubiquitous Türkic suffix -ta/-da stands saliently inscrutable. The ken of the semantics for Türkic and Gmc. words, translated by a generous complement of semantic siblings, is essentially focused on the notion “next, following, also”. The antipode of “here” vs. “there” and “this” vs. “that” created semantic extensions signifying “opposition, against”, and “in front of, before, end” found in Gk. anta (αντα), anti (αντι) anten (αντεν); Lat. ante (prep., adv.), in derivative compounds starting with ante- (Cf. antechamber) and anti- (Cf. antibiotic); Skt. anta “end, border, boundary”; Hitt. hanti “opposite”. The southern vernaculars display an emphasis on “there” and “that” side. Phonetic and semantic similitude leaves no room to doubt the common and relatively recent genetic connection of the Gmc. and Türkic twins, that is amplified by the preserved original Gmc. meaning of the word. The Türkic version has an ablative shade conveyed by the suffix -ta, due to the expressed presence of the ablative syntax and morphology in Türkic languages. The Gmc. languages metamorphosed to shed the ablative morphology, and absorbed the remainder of the suffix into the root. Across Eurasia, a majority of the cognates have retained the ablative suffix -ta, and some may have retained its function. The “IE etymology” suffers a major bout of incoherence, ascending and separately from an unattested “PIE root” *en “in”, and ante- separately from an unattested “PIE root” *anti- “opposite, against”. The first “root” happened to reflect the attested Türkic root an-, an oblique case of and heterogeneous with bul-, bol- “be” with no relation to the etymology and semantics of in (prep.) “toward the inside”, which happened to be the a retained Türkic noun in “bottom, descent”, see in. The second is minimally adulterated Türkic anta retained with intact and truncated ablative suffix -ta. Truncation may be specifically attributed to the Sarmat vernaculars. The southern traces attest to the presence of the formed word before the Indo-Aryan-type migrations of the 2nd mill. BC. The absence of the formed word, and the consistent presence of the root an- in the Celtic languages attest to the existence of the root word in the 5th mill BC prior to the Celtic circum-Mediterranean migration. Taken together the quartet, the anta with the prepositions/postpositions in and on, and the directional suffix -ta, presents a salient case of paradigmatic transfer, attesting to a common origin from a Türkic linguistic milieu. The unattested “PG” and “PIE” words, naturally, had never existed. Notably, the word and is a major contributor to the English daily language, alone producing about 2% of the English ordinary lexicon. See be, at, in, on 1, on 2, till, to.

English big (adj., adv.) “large” (Sw27, F201, 0.08%) ~ Türkic biŋ (bïŋ, miŋ, minŋ, mïŋ) (n., adj., adv.) “big”, “much, big number, huge amount, intensity”. An IE etymology rates it a routine “of unknown origin”, “of obscure origin, possibly from a Scandinavian source”. A literary example for the allophonic boŋ cites identical form in the Norse: Khakass boŋ kiši: “big man”, Norse bugge “great (man)”, the o/u in Türkic languages are interchangeable. A synonymous “great”, the A.-Sax. great(ae), ascends to the Türkic gür “thick, dense, abundant, luxuriant” with about identical semantics. A synonymous “large”, the Lat. largus, also “of unknown origin”, ascends to a Türkic ular(-di, -g, -ɣ, etc.) “enlarge” fr. ula “join, unite, attach”. In Türkic, “big” also means “thousand”, like the Sl. “dark” and “fog” (Sl. tma (òüìà) and tuman (òóìàí) respectively) are semantic extensions of the Türkic tümän “ten thousand”. The Türkic big has numerous allophonic forms, most important is m/b alternation, biŋ/miŋ, the other forms are miŋ, mi:g, mi:ŋ, müŋ, beŋ, bi:ŋ, boŋ, pin, etc. The form biŋ/bi:ŋ is recorded in Ottoman, Türkü (Türkic Kaganate, pre-Uigur period), Uigur and OKirgiz. The form mig is recorded in Khakass, Horezmian, Chagatai, and Koman, encountered in English Turkisms. Which particular tribe(s) left their trace in English is impossible to suggest: outside of OKirgiz and Khakass, the three major entities were huge polities covering hundreds of ethnic tribes. Cognates: A.-Sax. micel (mikel) “great”, Norse bugge “great (man)”; Rum. bimbaše, bombašir “big (boss), major” (n > m); Bulg. bimbašyja, bumbašir ditto, Serb. bimbaša ditto; Alb. bimbaš ditto; Osset. min, ming  “big”; Gk. bimpasis (μπιμπασης) “major (big boss), bimpas”; Mong. mingaan, mingãan, mıngan “big”; Mongor. minhän “big”; references to Caucasian languages: Balkar, Karachai, Kabardin, Shaps, Abaz., Ubykh; reference to Ch. language. Distribution traces the Türkic trails, across Eurasia from Atlantic to Pacific. No IE etymology, but still funny suggestions of origins from “dwell”, “PG” *bewwuz “crop, barley”. The A.-Sax. micel (mikel) consists of mic-/mik- (~ mig/mi:g) + Türkic adj., adv. suffix -al/-el (-la/lä), “rare” in the eastern languages. The form comes from a linguistically separate source, possibly related to Khakass or Horezmian. Notably, both OKirgiz and Khakass refer to the same Enisean Kirgiz people at different stages of their history. An m/b alternation mic- > big bears some diagnostic trait. A triad “big”, “great”, “large” presents a case of paradigmatic transfer, attesting to a genetic origin from Türkic sources. See bunch, mickle, might.

English black (adj., v., n.) “pitch-dark” (Sw176, F751, 0.01%) ~ Türkic belä-, bele-, bula-, bulɣà- “smear, mar, soil, dirty”. A polysemantic primary verbal root bul- forms 9 notions, a verb belä- “smear” is a derivative of a secondary notion “mix”. The achromatic notion “smear” extends from extreme light to extreme dark, which leads to the opposite ends of a grey spectrum, Cf. Tr. bulut “cloud” (i.e. lit. “grey, smeared”), see cloud. A black shade is on one extremity and a white shade (e.g. bald, blanch, blank, bleach, bleak, blear, blaze, pale, etc.) is on the other end. The bifurcation was numerously noted within the IE philology but never properly understood nor intelligently explained. A noun/adj. notion beläk “black” and its dialectal siblings are formed with a deverbal noun suffix -k/-q. The root vowel takes form of -u-, -e-, -ï-, -i-, and -o-, creating a two-dimensional semantic/phonetic spectrum of randomly connected derivatives, Cf. bald, bleach, blitz, blur, etc. E. Sevortyan asserts a predating antiquity of the base root bul (EDTL, 1978, 252-3). Cognates: A.-Sax. blac, blaec “black, dark”, “ink”, “bright, shining, glittering, flashing”, and “pale, pallid, wan”, blaecce “black matter”, blaecfexede “black-haired”, blaechorn (e) “ink-horn”, OE blæc “dark”, ONorse blakkr “dark”, OHG blah “black”, Sw. bläck “ink”, Du. blaken “to burn” (charred black?); Lat. flagrare “to blaze, glow, burn” (?) (< fiamma “flame”, red tongues of fire); Rus. bulyndatsya (áóëûíäàòüñÿ) “blemish, mar”, balmosh (áàëìîø) “mixed up, tangle”, Serb. bulandisati “maroon”, Pol. bulany ditto; Osset. bulğaq “brush, jumble”; Kalm. bulngir, bulngr “dirty, unclean, muddy, mud, sediment”; Tung. bula- “swamp, bog”. Distribution spans Eurasia from Atlantic to Pacific, with a Gmc. isle in Europe. An IE notion of “burn” is etymologically unrelated to the notion “black”, i.e. burned meat made black and the like. Suggested “PIE proto-word” *bhleg- “burn, gleam, shine, flash”, *bhel- “blaze, bright”  are beyond contempt. The unattested fictitious PG and PIE *blakaz “burned” and *bhleg- “burn” are no more than mechanical spoofs of the attested Türkic root belä- and its Gmc. cognates without a hint on a morphological role of the suffixes -kaz and -eg. The contrasting content of Sl. beliy (áåëûé) “white” vs. Eng. black, amplified by a host of conforming derivatives, is noted by OED: “In ME. it is often doubtful whether... (it) means 'black' or 'pale'...”. A synonymous A.-Sax. sweart/swart/sweard “swarthy, black, dark”, swearttan “become, make black”, and wan/wann “dark, dusky, lurid”, wannian “become, turn dark-colored, black” attest to the amalgamated nature of the A.-Sax. language, where the old European sweart and wan were supplanted by the Türkic belä-. The replacement attests to a local demographic predominance of the Türkic lexis. In contrast, the old European A.-Sax. term hwit for the “white” had survived, attesting to a demographic balance contrary the situation with sweart. The form belä- and the notion “smear” carry a diagnostic weight, they are colloquial: belä- Tr., Uz., “smear” Kir., Kip., Kaz., Karakalpak, Tat., Uig., Tuv. “mix”. They point to an easterly origin extending in the west to the Itil (Volga). Such a scenario definitely expands the amalgamation period into remote pre-historical times. Notably, the word entered European languages not as a color or shade, but as a notion of a “mar” or “smear” leading to them, as an innovation peculiar to the Gmc. phylum with some random and distinctly different reflex found in the Italic Lat. That sets the date to prior to the Italic exodus from the E. Europe ca. 1500 BC, and excludes the Celtic knowledge of the word at the time of their circum-Mediterranean route from the E. Europe ca. 5th mill. BC. That is corroborated by the Celtic forms dubh/du (Ir., Welsh, Gaelic) for the word “black”, a cognate of the Türkic dawa:/dava “dye”. A phonetic and semantic concordance, the peculiarity of the contrasting derivatives, the peculiarity of the exclusively Gmc. European provenance, the absence of credible etymological alternatives, and indirect attestation of the Celtic examples provide convincing and vaguely datable evidence of a Türkic origin. See bald, bleach, blitz, blur, cloud, dye.

English child (n.) “youth, minor” (Sw39, F497, 0.04%) ~ Türkic koldaš, qoldaš (n.) “comrade, friend, fellow”. The word koldaš is a noun describing persons associated with in some way, fr. ko:l “arm, upper arm”, it means “comrade”, lit. “one with whom one links arms”, and applies to the court pages and servants from noble families. The terms for front legs and their parts in Türkic and Eng. are identical for animals and humans, unlike e.g. Rus. or Tung.-Manchu. According to M. Kashgari I 461, “the word (koldaš) is used only between servants of notables”; i.e. it is a word from a category of “upper class word”, like, for example give “bestow”. Cf. Rus. eidetic term deti boyarskie (äåòè áîÿðñêèå) “Boyar's children” vs. Sl. otrok (îòðîê) and rebenok (ðåáåíîê) “youth, child”, see tad. The deti underwent identical social shift in meaning and application, expanding from a specific to a generic noun. A devaluation of the term and its adoption across a society is alike in both cases. Cognates: A.-Sax. cild, childe, pl. cildru (with k-) “youth of gentle birth”, Dan. kuld “children of the same marriage”, OSw. kulder “litter”; Pers. qolčaq “bracer” (< Tr.); Mong. qolturi-, qolturqai “handicapped arm”, qol- “armpit (< qoluŋ(gusu) “armpit stench”, < Tr.), Khalk. holtoroh “handicapped arm”; Tungus-Manchu xoldo “side (left, right)”; references to Balkan Sl. languages. Distribution for the base “arm” spans the width of Eurasia, derivative “child” spans Steppe Belt with an isle in the European NW. The IE etymology asserts that it does not know any “certain cognates outside Germanic”. In a circular logic it then deduces child fr. A.-Sax. cild, childe “fetus, infant”, dropping crucial “gentle birth”. Then it appeals to unrelated Goth. kilþei “womb”, inkilþo “pregnant”, while the A.-Sax. native “womb” is hama, fahame, cildhama, fea-hama, feorh-hama related to haeman “intercourse, cohabit, marry”, vs. Goth. kilþei, but Cf. Goth. derisive kalkjo “harlot”. The OE interpretation “fetus, infant” in the context of the social function is an open manipulation. The IE etymology's reference to the girls is also out of kilter. It conflicts with the known institute of the male pages and service boys: female companions were next to invisible, and in contrast the page boys, dubbed as bodyguards, courtiers, etc., were groomed to be army commanders and a ruling elite. An ascent to the root gen- is tempting but conflicts with the attested historical sequence and is unattainable phonetically, see gene. The social nature of the terms koldaš and child, the repeated emphasis of independent sources separated by a half-continent distance, on their noble origin and identical function, the accent on the noble descent of its carriers makes incredible a thesis that the term is a derivative of the words like “womb”: everyone comes from the womb, but very few are distinguished by nobility. The same with the genesis nature of gen-. None of the cognates offer an IE origin, the word cild (with k-) and its allophones are lucid loanwords within the IE family, their origin came from elsewhere. The search for etymology of the word child is marked by parochialism, it inevitably had to produce flimsy results palatable only to the believers. One language that penetrated all corners of the Eurasia has not been explored, probably because of a subliminal combination of self-supremacy vs. insignificance typical for confined mentality. A diligent search would definitely have produced less tenuous results. This is not a question of expanding the search filed, it is a question of following precepts of scientific research. The phonetic contraction of -aš for an assimilated word, the fluidity of the vowel -i-/-o-/-u-, and the alternation k-/ch- are routine. The actual semantic drift is well documented. The word child illustrates a wider aspect, a tackling of etymology should expect not entirely predictable ingenuity, it calls for a cast beyond mechanical manipulations in a search for nearest phonetic and semantic siblings. At the ridges of blending cultures, interaction of the linguistic nature and environmental nurture creates lexical innovations. See gene, give, tad.

English dawn (n.) “sunrise” ~ Türkic daŋ, dang, taŋ, čaŋ (n.), taŋla:-, daŋla:- (v.) “dawn”. Sunrise had a primary role in Türkic societies, it was a morning prayer and a new day. It carried a daily flavor of awe. A.-Sax. had an extensively developed lexicon for “dawn”, attesting to an old age of internalization. Cognates: A.-Sax. dagian (v.) “dawn”, dagung, daegred “dawn, daybreak”, daegredwoma “dawn”, daegredleoma “light of dawn”, ONorse dagan “dawning”, Dan., Sw. dagning (v.) “dawn”, Gmn. tagen (v.) “dawn”; Ch. dan, dang (旦) “sunrise, morning”. Distribution extends from Atlantic to Pacific. Both an IE family and a PG are innocent of originating the word. No references to Tungus-Mong. cognates. Whether Türkic inherited the word from Ch., or Ch. inherited it from a Türkic Zhou is a mute question. For a 3-phoneme word the coincidence may statistically be not overly impressive, but in IE etymological context it is ominous. It moves any claim to IE or Gmc. origin to a category of lunacy, and sends any and all related “reconstructions” to a dust bin of the IE theory. The only reasonable link connecting Gmc. and Sino-Tibetan languages is an overreaching mobility of the Türkic languages. Aside from a Forrer's unstated surmisal about Türkic being a substrate component of the Gmc. branch of the IE family, that would need an unprejudiced stamina to absorb. Considering that SE Asia had its own motley path of peopling totally isolated from that of a Middle Eastern path, that lexical continuity should have inspired some inquisitive thinking. There was a total absence of biological genetic connections between Old Europe I Y-DNA, Chinese O2, and Türkic Kurgans' R1a/b. The Türkic dün ~ Eng. day ascend to the same stem tang/taŋ/toɣ “dawn”, the pair constitutes a case of paradigmatic transfer attesting to a well established demographic convergence, linguistic amalgamation, and to an origin from or via a Türkic phylum. See day.

English day (n.) “daytime, 24-hour interval” (Sw178, F183, 0.11%) ~ Türkic taŋ, daŋ, toɣ- (n.) “dawn, sunrise”, dün “yesterday”. Two allophones, toɣ- and taŋ/daŋ “dawn”, molded the European Gmc., Balto-Sl., and Romance forms. Day and dawn are two siblings of the same root, see dawn. The European forms for the “day” ascend to the concrete noun “dawn” and its notion of coming or having daylight. Cognates: A.-Sax. dæg “day”, OSw., MDu., Du. dag, OFris. dei, OHG tag, Gmn. Tag, ONorse dagr, Goth. dags; Balt. (Latv.) diena > Slavic den; Lat. dies; Ch. dan, dang (旦) “sunrise, morning”. Distribution extends from Atlantic to Pacific. Like in a case with “dawn”, both an IE family and a PG are innocent of originating the word. No references to northern Far Eastern cognates. Whether Türkic inherited the word from Ch., or Ch. inherited it from a Türkic Zhou is a mute question, but numerous prongs of the “Scythian” nomads moved to the Far East. For a 3-phoneme word the coincidence may statistically be not overly impressive, but in IE etymological context it is ominous. It moves any claim to IE or Gmc. origin to a category of lunacy, and sends any and all related “reconstructions” to a dust bin of the IE theory. The only reasonable link connecting Gmc. and Sino-Tibetan languages is an overreaching mobility of the Türkic languages. Aside from a Forrer's unstated surmisal about Türkic being a substrate component of the Gmc. branch of the IE family, that would need an unprejudiced stamina to absorb. Considering that SE Asia had its own motley path of peopling totally isolated from that of a Middle Eastern path, that lexical continuity should have inspired some inquisitive thinking. There was a total absence of biological genetic connections between Old Europe I Y-DNA, Chinese O2, and Türkic Kurgans' R1a/b. Transition from Türkic toɣ- to Gmc. dæg, dag, tag, etc. is transparent, likewise is a transition from Türkic taŋ/daŋ to Balto-Sl. diena, den, etc. The Romance, Eng., OFris. forms follow either the -n-/-y- or -g > -y > zero alternation, routine for the Türkic languages. A transition of the Türkic labial vowels to diphthongs in the Baltic languages is systemic. The transition ü => ie is one such systemic transition, with subsequent reduction of diphthongs in Slavic languages, which is one of the diagnostic parameters for the direction of the linguistic substrate process: Türkic > Baltic > Slavic. Dialectal transition of the Türkic consonants to semi-consonant -y- is well documented, hence the Kipchak 13th c. form tayn, and the -y ending instead of consonant in numerous languages, including English, Cf. ig, iy “illness, disease”, dag, day. Another form for day in Türkic is kün, the word for sun, which is still active. The semantics of sun is preserved in Turkic “south”, “midday”; reflexes of the Türkic kün with dialectal k/d alternation are preserved in Skt. dah “to burn”, Balt. (Lith.) dagas “hot season”, OPrus. dagis “summer”. The presence of the Skt. cognate, and the presence of both still active forms in Türkic indicates that the split into kün/dün versions happened before the eastward march of the Aryan agriculturists crossing the N. Pontic steppe ca. 2000 BC. The closest to the NE European forms is the form dün meaning “yesterday” in Az., Osm., and Tkm. (düyn) languages located in the southern end of the Sarmat area, and in the Central Asian Tuvan language that may have re-borrowed a Mong. form borrowed from the eastern Huns at the turn of the eras. The pair “dawn” - “day” constitutes a case of paradigmatic transfer attesting to a well established demographic convergence, linguistic amalgamation, and to an origin from or via a Türkic phylum. See dawn.

English Earth (n.) “earth, Earth” (Sw N/A, F987, 0.01%) ~ Türkic er, yer (n.) “earth, place”, erdä, yerdä “on earth, in place”. The Türkic er “man” must be an archaic semantic relative of the er “earth”, or vice-versa, like a “man” and his “place”. A spectrum of phonetic allophones is scattered among Türkic languages: yer, yar, jer, djer, cher, chir, dier, sar, ser, tier, ker, they offer plenty of choices for the daughter forms, Cf. Ir. cre, uir. A Sum. yer attests to an origin predating 4th mill. BC. At one time, around 4,000 ybp, the word er “earth” spread truly international, it is native to the Old Semitic, Sumerian, Germanic, Tungus, and all Türkic languages, covering a space from Mediterranean to Korea. The later Semitic cuneiform records put it in female gender with a suffix -at, pointing to a primeval notion of a “man” as a “woman”. An anlaut consonant probably was not a later prosthesis, but represents an original archaic form of the Ogur languages. With consonants g- and d-, and semi-consonants j- and y-, ger/der/jer/yer, it was also recorded in the Scythian Oguric form Gerra “the land of our ancestors”. The Oguz branch lost the anlaut consonant: Oguz and Gk. er. Cognates: A.-Sax. ear, eard, eorð, eorðe, heorð “earth”, Gmn. erde, from the Türkic root er (yer) which produced Gmn. noun ertho, and ultimately Gmn. erde, Du. aarde, Dan. and Sw. jord, and English earth; Ir. cre, uir, Welsh ddaear/daear “earth”, erw “field”; Lat. terra; Gk. er, era, eraze “on the ground”; Skt. thira; Hu. szer(å) (/sere/); Basque lurra; Arm. erkir (երկիր) “earth”; Arab., Maltese art, ard; Sum. yer “earth”, Akkad. yersat; Pers. yer, arz, arazi (pl.) “earth, lands”; Kurdish yer; Kuchaean (“Tocharian B”) yare “gravel”; Tungus, Manchu, Even yerka, jerke “earth, world, universe”; Cuv. sar; Telengut ser “earth”. Distribution on Eurasian scale, with spills to Europe and far East. A myopic IE etymology is funny, it limits its scholarly horizon to “IE cognates” and offers a 3rd mill BC faux “proto-word” for an attested 4th mill BC ancestor. The Common Gmc. form Erde is peculiar, supposedly -de is a female gender marker under an incompatible typology, and the marker was preserved after another creolizing metamorphose to an analytic typology, becoming a tautological die Erde with a female form of an article and an archaic gender marker suffix and eorðe. Rather, the -de is a Türkic locative suffix appropriate for a reference to a place, q.v.. The anlaut consonant common to the European reflexes in Welsh, Basque, and Lat. probably reflects an early form brought over to the European Iberia 4800 ybp by the circum-Mediterranean Celtic Kurgan migrants from the Pontic steppes. By that time, Kurgan waves had ingrained in Europe their continental allophones. The Skt. reflex comes later from the Pontic steppes to India ca. 3600 ybp. Another migration path from the Pontic steppes was direct to the Balkans and C. Europe connected with the Kurgan migrations. The forms with the anlaut t-/d- ascend to the Türkic ter “pasture land”. The duo of er “earth” and er “man” constitutes a case of paradigmatic transfer, indelibly attesting to a genetic origin from a Türkic linguistic milieu (A. Dybo 2013, 219). See -er, ore, terrain, turf.

Numerals – Preliminary Note. The IE ordinals vs. cardinals for numerals one and two are suppletive. The duality does not have an accepted explanation. The origin of the ordinals for the pair is deliberately excluded from the IE base vocabulary, and any suggested etymologies are distinctly unrealistic. At the same time, the nearest cognates for one and two are geographically close neighbors of the Athens and Rome, they are the Türkic bir “one” and ikki “two”. It would be both prudent and wise to succumb to reality and explore linguistic situation as it comes rather than as it ought to be.

English first (adj., n., adv.) “ordinal of one, initial, beginning, preceding” (Sw N/A, F171, 0.09%) ~ Türkic bir (n.) “one”; bir, birinč “first”. Suffix -inč consists of a selective suffix -in, and corresponds to the Eng. ordinal suffix -th, Cf. tenth. An older Türkic suffix -inti ~ -inch uses suffix -in and ordinal marker -ti that corresponds to the suffix -th, Cf. ekinti “second”. In different languages -ti/-č is reflected with forms -th, -st, -t, -sht. Of 44 European languages, 32 (73%) use allophones of the bir, matching a level of the European 50.6% R1a/b demographic presence in Europe. The other 12 languages use 7 native words. Cognates: A.-Sax. aerest, fyrst “first, in front of”, for “before”, OSax. fuirst, OFris. ferist, ONorse fyrstr, Dan. første, Du. vorst, OHG furist, all “first”, MDu vorste “prince”, Gmn. fürst “prince”; Balto-Sl. (Lith.) pirmas, (Latv.) pìrmais, Sl. pervyi (ïåðâûé) “first”, also Cf. teper òåïåðü) “now”, Serb. birzeman “once”, birden, birlija “unity”, Bolg. birebir “unilateral”; Lat. primus “first”, prae “before”, It., Sp. primo; Gk. protos (πρωτος) “first”, pro, paros “in the past”; Pers. bir “one”; Skt. pura “at first, in the past”, Av. paro “in the past”; Heb. (Yidd.) ersht (ערשט), Heb. reshith “first”; Malay pertama “first”; Hu. első and Afrikaans eerste (elided initial b-/p-/f-). Distribution uniformly lines up as appendages to the length of the Eurasian Steppe Belt, covering a space from the European Atlantic to the Kamchatka Peninsula on the Pacific. IE languages have a spectrum of genetically unconnected stems for cardinal “one” (Cf. one and ek), but the ordinal “first” is fairly universal. A motley origin of the cardinal “one” forced IE theoreticians to exclude “one” from a small complement of the “PIE proto-words” that constitute a PIE theoretical “base vocabulary”. European forms for the notion “first”, other than those derived from the Türkic bir, tend to derive not from the native cardinal “one”, but from the native words for outset, beginning, and the like, attesting that the native Sprachbunds did not have a word for “first” (Cf. Basque bat “one”, lehen “first”). A paucity of synonyms for the secondary notion “before” attests to the great lacuna in expressing that notion. In addition to first and bir, both Türkic and Eng. have numerous synonyms to express “first”, in both cases the main metaphor is head (Cf. headmaster). Both languages use the word ilk for a prototype, with modest semantic difference. IE etymology ascends the IE cluster for the ordinal “first” from a preposition per with a basic meanings of “forward, through”. That is an artificial non-viable construct in inverted sequence that derives a base word from its derivative. That negates the IE thicket of pretentiously ancestral fore, the “PG” *fur, *fura, *furi “before”, *frestaz, *fristiz, *fresta “date, appointed time”, “PIE” *per-, *pero-, *pres- “forward, beyond, around, forth, over”, *prae-  “in front of, before”, *pre-isto- ditto. A base cardinal “one” had to appear first, it is sorely missing in that mental enchilada,. A common attested Türkic bir with areal articulations as for, fyr, pr-, etc. turns fantasy into reality. The ordinal “first” was used as pre- and post-positions (Cf. A.-Sax. forwost “first, leader, head”) that have developed into prefixes in languages with typology that allowed a contamination of the roots with prefixes, i.e. did not rely on the primacy of the root in a derivative. The vowel-initial forms aer “ere”, Heb. ersht (ערשט) and reshith (metathesized version) etc. elided an initial consonant. A decent fr. the Semitic ahad, awwal must be excluded. Except for few metaphoric expressions, practically all IE forms for the notion “before” ultimately ascend to the Türkic cardinal “one”. European languages have developed numerous derivatives and applications using cognates of bir “one”, attesting to a long history of internalization, further amplified by the innovations of the prefixes pr-, pre-, pri-, and the like. As a result of Indo-Europeanization, first of the European languages, and then of the internationalization of the European vocabularies, these words occupy a prominent place in the dictionaries of many languages of the world. The degree of internalization by the time of the initial written records attest to exceedingly deep roots. The phonetic and semantic differences attest to numerous and independent paths. The shared words between the European and Asian IE fractions attest to the presence of the source word prior to the ca. 2000 BC Indo-Aryan migration from the Eastern Europe to Asia. The absence of the cognates in the Celtic languages dates the emergence or spread of the Türkic form by the time later than the Celtic departure from the Eastern Europe to Iberia ca. 5th-4th mill. BC. These terminal dates point to dissemination of the word during Kurgan migrations of the 4th-3rd mill. BC. That also allows a provisional dating for the -ti/ transition, becoming a dating indicator for the Türkic/Germanic separation, since the Gmc. fraction retained the Türkic -ti form as -th and -st, and the Gk. retained the form protos with the original suffix -t. Dissemination realistically explains the suppletion of the IE ordinals vs. cardinals, solving the insurmountable puzzle of the IE linguistics and connections between the Türkic and IE cardinals. The paradigmatic transfer of the complex first, second, and ilk, and of the integral morphological elements unimpeachably attests to their Türkic origin. See ilk, Numerals – Preliminary Note, prior, second.

English land (n.) “ground, grounds” (Sw N/A, F1458, 0.01%) ~ Türkic elen (n.) “certain land, possession, its people”, ellä-, illä- (v.) “unite, confederate”. Incoherence and incredulity of the IE etymology is barely masked by excessive chatter. An ultimate origin comes from a base notion el/il “tribe, people” and its overlay “master, ours, rule (v.)” from a base notion el “arm”. Form ıl- (with ı-) “descend” points to a notion “descent from progenitors”. The forms el and il have diverged into phonetically and semantically overlapping ethnic lines, attesting to primeval origins. Chains of the earliest notions “tribe” > “confederation” > “people, land” > “country” > “state”, and “peace” > “concordance” > “tranquility” defines “land” in a context of the notion “people” as an attribute of “country” and “state”. The notions “country” and “people” are interchangeable. The root el/il is generic and polysemantic, it forms an array of notions each equipped with its array of meanings from “tribe” to “quiet”, etc. Subdividing overlapping el/il (n.) into separate lines el and il mars the issue: articulations are local. Flavors of the base notions are carried by a vast array of derivatives. The elen is an inflexion of el/il (n.) “land, country, realm, people” to convey a relation. A wealth of attested phrases reflect elen = “land”: elin arturu “conferred land” (artur “give, confer”, see Arthur), bodunuŋ elin törüsin “ruled people of the land” (törü “law”, see torah), elin etti, beklädi, tüzdi, törüsin “prepared, strengthen country, people”, elin saqladï “saved country, people”, elin qoδtï “left the land”, etc. (EDT pp. 58, 109, 111, 169, 171, 172, 389, 487, 570, 602, 665). An attested verbal form iləndi-/ilindi:/ilinti- “landed, entrenched, hooked, ruled (a beachhead, land)” is a second, largely interchangeable form ilən = elen = “land”. Of 44 European languages, 13 (30%) use their own 16 different terms, 11 (25%) use Balto-Sl. zem- “low”, and 20 (46%) use off-Türkic terms tera “valley, lowland” and elen “land” approximating a level of the European 50.6% R1a/b demographic presence in Europe, see terra. An unaccented anlaut vowel e-/i- elided at early Gmc. internalizations. Cognates: A.-Sax. land, lond “earth, land, soil”, inland “demesne”, landwela “possessions” (+ 5 dozens more), WFris. lan “land”, ODu. lant, Du., Norse, Sw., Icl., Gmn., Goth. land “land, country”; OIr. lann “heath”, land, Scots laund “land” (+ fearann), MWelsh llan “open space”, Welsh “enclosure”, Breton lann “heath”; OCS ledo (ëѧäî), ledina “heath”, Cz. lada “fallow land”; Fr. lande “land”; Gk. Hellada (Ελλαδα), Hellas “Greece”, ellinikos (ελληνικος) “Hellenic” (< Tr. el; +Tr. elläš- (v.) “accordant, peaceful”), Hellene “Greek, ethnic or of Greece country” (< el); Alb. lëndinë “heath, grassland”; Mong. el “peace”, else- “peaceful”, elči “envoy, peacemaker”, ulus “people”, elde- “handiwork” (< el), ili-, iləh, ele-, ile- “canoodle, caress, stroke, rub (< handiwork)”, Kalm. el “peace, concordance, tribesman, familiar”, Evenk ılel “people, public”; Tungus elgemnı “leader” (< el), elke “quiet” (~ “peaceful”), elket “cautious”, Manchu elhe “familiar” (~ “peaceful”), Nanai elžini “leader” (< el); Kor. eruda, elluda (v.) “lead”. Distribution is ubiquitous from Atlantic to Pacific, a Gmc. isle is a European NW fringe. The IE etymology for “land” does not exist. On one hand it rightly “suspects a substratum word in Gmc.”, but on another hand drowns in uncouth origins. The letter are faux and unrelated PG *landja- (“?”), PIE *lendh- (2) “hip, kidney”, attested A.-Sax. hland, hlond “urine, lant (aged urine)” ~ PWG *hland < PG *hlanda “urine” < PIE *klan- “liquid, wet ground”, and some more of the likewise pearls. Eng. “land” has eidetic forms in all other Gmc. languages. El/il in “Île-de-France” is traditionally held as “Island-of-France”; but for a landlocked territory with a river in its center splitting it into two half-pies the “island” does not make sense, “France land” is more suitable. The Gk. terms are rated as “words of unknown origin”, but they make perfect sense in Türkic, and are supported by an ancient Türkic-Gk. symbiosis (Herodotus IV 108, 109, 120). The Gmc. form “land” is eidetic to the Türkic form. The form Ellas is a reflex of the verb elle:- “peaceably incorporate into the el realm”, a deverbal noun for the Greek settlers in the N. Pontic Scythian lands, Cf. Herodotus q.v.: “Gelons were Greeks from long ago, they speak partly in Scythian, and partly in Hellenic”, they belonged to the populace called Budini, see body. For the “land”, Greeks had their own term, they did not have to borrow it. The elision of the anlaut vowel e- and the presence of the auslaut suffix -d are morphologically explainable effects, Cf. bilge:d-, Kashgari II, 340. The anlaut vowel e- may have been an early Oguz peculiarity in articulation of the initial liquids, in contrast with the Ogur articulation that either did not add an initial e- or covered it with a consonant. A targeted study of Ogur linguistics has not been yet initiated. See Arthur, body, terra, torah.

English man (n.) “person, human, male” (Sw37, F128, 0.15%) ~ Türkic man (n.) “man, men, mature sheep”, man- (v.) “go, move”, manlai “forehead, scout, front line of army”, mãn(ã) (Chuv.) “adult” . An IE etymology offers a “probably” inexplicable primordial “man”. Ultimately fr. a Türkic pronoun root men/min “I, me” and a postpositive pers. marker in nominal and participle compound predicates. Although the form man, maŋ has developed a vast semantic array, the ones connected with aspects of a notion “man” - size, strength, maturity - are present in all applications. Türkic has a quartet of terms for “man”, with different distribution levels and semantic shades: adam (casual), er (muscular), man (stranger), and kiši (neut.). As a “man”, a noun man is attested in the Caspian-Aral basin, Cf. Azeri bir man “a man”, man gəldi “man (has) arrived” (~ kel- “come”, see Celt) vs. bən gəldim “I arrived” vs. Turkish adam geldi (adam “man”, see Adam). That is consistent with numerous other Horezmian (Turanian, Turkmenistan) Türkic-English parallels. As an instrumental noun suffix, -man can form both animate and inanimate derivatives. Of 44 European languages, 15 (34%) use allophones of “man”, 2 (5%, Ir. and Hu.) use allophones of er, thus 17 (39%) use off-Türkic roots, approaching a level of the European 50.6% R1a/b demographic presence in Europe. 9 (20%) languages use allophones of Romance hom-, 7 (16%) use Sl. chelo- “face”, 2 groups of 2 (9%) languages use their own different words, and 7 (16%) others use their own 7 different words. Cognates: A.-Sax. man “man”, meh, mec (pron.) “me (1 pers. sg.)”, min, mine (pron.) “my, mine”, OSax. man “man”,  Fris. man, ONorse maðr, OHG, Sw., Du. man, Dan. mand, Goth. manna, Icl. maðr, maður, menn (pl.), Gmn. mann, “man”; OCS mjj (ìѫæü) “man, husband”, Sl. muž, muj (ìóæ), mujik (ìóæèê) (dim.) ditto, moy “mine”, “us”; Lith. žmogus “man”, žmones (pl.) “people”; Arm. mard (մարդ) “man”; Kurd. mer “man”; Pers. man “I, myself”, Taj. mard “man”, Pahlavi ma:n “us”; Sogd. man “me, myself”; Fin. mies, Est. mees “man”; Gujarati mena (પુરુષ) “man”; Skt., Av. manus, manu- (मनु, मानस) “man, human”; Evenk., Negid., Ulch., Udegei bi “I, myself”, bu “us”; Ch. ben (本) “I, myself, personally” (m/b alternation, ~ Türkic ben/men “I”), another one of the peculiar Eng.-Türkic-Ch. coincidences. In addition to the above far from exhaustive listing, most above languages have developed derivative lines, some of them most extensive, Cf. Eng. manful, manly, manhandle, manhole, one-upmanship, + many more. Distribution extends from Atlantic to Pacific, way beyond a Gmc. reach. No etymological IE etymology. In addition to “probably”, it runs in myopic circles with a faux “PG proto-word” *mann- and “PIE proto-word” *man- or *mon- “man”. Either one is not too enlightening etymologically. With a straight face it also cites an expert historian Tacitus with his folk Ìannus - a progenitor of the Germans and Manis (Μάνης) - a progenitor of the Phrygians. In his time (ca. 100 AD) Tacitus have not known of a legendary progenitor of progenitors, a synonymous Adam “man”, and could not have enlighten his audience on the Genesis story. The entire case of myopic etymology is due to be gracefully retracted. A.-Sax. had 13 ways to express “man”, of which at least 4 were metaphorical extensions (healer, match, herdsman, producer), 4 were Türkic cognates (beorn ~ ber- “bearer”, carl ~ kür “courageous, manly”, man ~ men “man”, weor, wer ~ er “man”), and 6 belonged to the European native languages (guma, gome Cf. Italic homo, human, leod Cf. Gmc. liut “people”, maecga “match, companion”, rinc “rink”, secg “segge”, waep “male”). The breakdown illustrates the amalgamated nature of the A.-Sax. language inherited by Eng. In Eng., like a postposition in Türkic, man also serves as an suffix of a noun, as in workman, serviceman, with some peculiarities, for example alternation man/men to indicate plurality is impossible in Türkic agglutinative languages, it is an Eng. innovation. The Sl. mujik was formed with a Türkic diminutive suffix -k reflecting a subordinate status of the object. Since the notion “man” also dubbed as a “warrior”, numerous dictionary words expressing “warrior” also dubbed as “man”. Compounds of the type object + man are standard in both Türkic and English, Cf. A.-Sax. ancorman, chapman, hierdeman, and the Tr. ataman “leader”, lit “father-man”, dušman “enemy”, hetman (getman) “title”, müsülman “Moslem”, etc. In all these instances, the suffix -man is definitely not “I, me”. A standing IE objection is that the “man” and “I, me” are semantically incongruent, but the actual practice of the IE etymology more than regularly allows much wider and often incongruent semantic fields than this (e.g. see -like q.v., references “body”, corpse”, etc.). Distribution of the Türkic part -man is clinal, the western areas tend to form concrete agent derivatives, while the eastern areas tend to retain archaic generic function. The split points to some tumultuous events of the late 3rd mill. BC that split the Eastern European farming population, drove some of it southeastward, and some of it westward. Nowadays those migrations are traceable by genetic dating. The synonymous trio adam, man, er “man” constitute a case of paradigmatic transfer, indelibly attesting to an origin from a Türkic phylum. See Adam, -er, Celt, -like.

English no (nah, nay, neither, nope, nor, not) (negation part., interj., adv., adj.) “not” (Sw16, F18 1.57%) ~ Türkic ne, ni, no, nü, nor “negative, negation (emphasis)”, ne:ŋ “not any, at all”, ne:če: “nothing”, (part., adv., adj., n.). A homophonic ne, genetically connected with the negation ne, dubs as a base for interrogative lexemes. Taken as two semantic lines of a common ne, the negating ne is direct “no” (ne jorïmaz “not journey”, ne jatmaz “not located”, ne uδïmaz “not sleeping”) and emphatic “not this, not that” (= “neither...nor”). The interrogative ne is “what, how”, negative “how come”, and the like. The negative ne has developed a system based on combinations with reliable indicators, and as syntactic phrases, but frozen at incipiency. It parallels numerous other negation lexemes and morphemes (e.g. -ma- etc.), some of them reflected in Eng.: del- “delete”, den- “deny” Cf. synonymous Lat. negare “deny” (< Tr. ne), Fr.: pa, pas- “not”, Cf. pas comme, ne pas “not like, not” (pas < Tr. -ma-, b/m/p alternation). The Türkic negation forms probably are developments of primeval indiscriminate front vowels a:, o:, and u:. It was noted that rounded and back vowels are unsuitable for expressive negation (Clauson 1972, p.3). The negation part. ne, ni, no gained a wide circulation in Europe: of 44 European languages it was adopted by 35 (80%) languages. A runner up 2-language Fennic group takes 5%, and the other 7 (16%) languages use their own terms. With their long roots ascending to the first Kurgan waves of the pre-Corded Ware times, a share of the off-Türkic allophones more than matches a 50.6% R1a/b demographic presence in Europe. Syntaxes and morphology of the European languages freed the Türkic negation from a rigid syntax, allowing it to blossom with grammatical functions and semantics. The English no is used as noun, adjective, adverb, and interjection. Cognates: A.-Sax. na, no (adv.) “no, never, not at all”, nan “none, not any”, na... na “neither...nor”, OSax. nigen “not any”, OFris. nan, nen “no, none, not any”, Saterland Frisian naan, neen ditto, NFris. nian ditto, WFris. gjin ditto, OFris., ONorse, OHG ne “no”, Goth. ni, Sw. annu, Gmn. nein, Icl. nr; Ir. uimh, no, Scots nae “no, not any, none”, Welsh nid, dim; Latv, Lith. ne; Sl. net (íåò) “no”, Ru. nechego, nichto (íå÷åãî, íè÷òî) “nothing”; Romance no, Lat. non, nihil “no, not”; Gk. den (δεν) (Cf. Welsh dim); Hu. nem, meg; Pers. nah; Gujarati na, nathi; Kor. ani (아니) “no”; Jap. janai “no”. Distribution spans from Atlantic to Pacific. A majority of the IE languages use a Türkic form, affirming a Corded Ware melting pot as a cradle of the IE languages. Other than that, there is no common IE “no”. Attempts to privatize an attested Türkic word as faux “PG proto-word” *ne and “PIE proto-word” *ne, “PG” *aiwi-  and “PIE” *aiw- “vital force, life, long life, eternity”, PWG *nain, “PG” *nainaz “not any”, “not one” supposedly leading to ne, no, in favor of some parochial interests, or portray it as a European loanword to the opposite end of the world, are not sustainable. Such fictions should be tactfully retracted. The IE etymological fantasy can't be called etymology. It goes in circular logics no < na < no +a “ever”, and uses unattested PGmc. and PIE reconstructions to come up with natural allophonic variations. Still, at the end it is reverting to the old basic Türkic stem ne. For “not”, the Eng., like the Türkic, has numerous allophones and spellings. In the usage frequency rating, these allophones occupy a very prominent place (Table 1a): no (F18, 0.81%), not (F23, 0.74%), neither (F948, 0.01%), nope (F1496, 0.01%), -n’t (F1.79%) for a total of 3.37% usage frequency, or about every 30th word of the daily Eng. language. Frequency listing table summarily shows them under a less frequent, but more concize entry not. The Gk. and Welsh den, dim are false cognates ascending to the Türkic den- “deny”. The Sw. annu is a false cognate, it ascends to the Türkic (“no”, interjection), with a non-nasal velar version an used in speech in some Türkic languages and found in records that don't discriminate nasal and non-nasal -n- (e.g. A.-Sax. Latinized records). Türkic uses regular and superlative aŋ aŋ for emphatic negation. The Türkic is a precursor for the Eng. un-, A.-Sax. an- and un-, Gk. and Lat. ana- (e.g. anaerobic), etc., see un-. One case sprouts a typical Gmc. prosthetic anlaut g-; occasional cases of metathesis or phonetic shift. The Celtic forms attest to a presence of ne- in the N. Pontic prior to the Celtic departure of 5th-4th mill. BC on their circum-Mediterranean voyage, millenniums before the Corded Ware time. A widely distributed Far Eastern anlaut j- prompted early scholars to suggest a common origin of the pan-Eurasian and Far Eastern “no”. Notably, a consensus Swadesh List carries “no”, but excludes “yes”. Although both are originally Türkic, only one is held as “IE”. The other is a foster child. A paradigmatic transfer of the no-yes couple serves as an indelible attestation of a genetic origin from a Türkic linguistic milieu. See deny, un-, yeah.

Supplementary Note. Direction of borrowing.

An assertion of ne “no, nor” being a Pers. loanword can’t be taken seriously, for a slew of reasons. A first because the direction of the borrowing is scientifically presumptuous and groundless. Secondly because chronologically, the not numerous Indo-Aryan farmers (here, Pers.) were latecomers (ca 1500 BC) to the Near East populated by Semitic, Dravidian, and Türkic people (Guties, Kangars, Kumans among others, ca 4500 BC), and the direction of cultural borrowing would flow in the opposite direction. Thirdly because the Pers. form ne was just one of very numerous allophones with cognates spread across Eurasia: ne, ni, no, nor, not, nyet, nein, nah, nay, neither, nope, and so on. The selection of the Pers. “proto-form” appears to carry a certain biased pretense. Fourthly, because reliance on a tiny segment of linguistic evidence, based mostly on late foreign sources, can’t pretend to reflect the linguistic developments spread across 6 millennia, from ca. 5000 BC (Kurgan migrants did not migrate to the Near East without their own languages) to ca. 1000 AD. Fifthly, unlike a massive influx of the Türkic haplogroups R1a and R1b into the western Europe, a demographically massive influx of the Persian predominantly J2 haplogroup to the western Europe never happened. Each one of these reasons is sufficient to discard the Pers. loanword speculation. The unwarranted assertion on the direction of borrowing must be retrieved.

A more acute perception would connect the Türkic fundamental root (“no”, interjection) both with the negation no in all its allophonic forms, with the negation clitic -ma-, and with the negation a- and un-. A transposition and transition to nasal and non-nasal velars are of routine linguistic evolution in both internal and loanword processes, greatly amplified with the development of writing and internationalization of scripts and communities. The Türkic as a precursor for the above negation forms appears in Eng. un-, Anglo-Sax. an- and un-, Gk. and Lat. ana- (e.g. anabiosis), etc. A peculiar -ŋ- was a Türkic phoneme that could not be reproduced in many other alphabets (e.g. the Anglo-Sax. Romanized records etc.).


English same (adj., pron., n.) “similar, comparable” (Sw N/A, F276, 0.05%) ~ Türkic ӧsüm (relative indefinite pron., conj.) “like, alike, akin, similar”. “Like” is one of at least seven semantic clusters of the verbal root os-/ӧš-. Ultimately “like” is a deverbal noun/adj. derivative fr. the base verb ӧš-/os-/aša- (emph. adv.) “(degree) excessively, very” with a base notion “to grow”, formed with a suffix -(a)m/-äm to denote a degree as an abstract noun, or result or object. The metamorphous sequence is detailed below, see some. The meanings “same” and “some” ascend to the semantics of the base root, the “same” directly as a base meaning. The “some”, in contrast, formed via linguistic evolution, correctly presumed by the mistaken IE etymology as a supplanter for a preexisting native words (aen-, aene “any”, hwilc “some (one)”) and Türkic words (ilk, A.-Sax. ilca/ilcy/illca; swa “just as”, a version of ӧš-/os- q.v.), see ilk. Cognates: A.-Sax. sama, same, swa “so as, just as”, samhwylc “same (family)” (hiwna, hi(g)na “members”), ONorse same, sumr, Norw. (Bokmal) samme, (Nynorsk) same, Dan. samme, Du. samen, Sw. samma, Icl. sama, OHG. samo, Gmn. selbe, selbig, Goth. sama “same”; OIrish som, Ir. caanna, Scots samin “same, like” + “together”; Gk. ομος “same”; Bosn., Croat, Serb. isti “same” (ӧ → i, - suffix -(a)m, + suffix -ti), Bulgar sysch (ñúù), Macedonian istite (èñòèòå); Fin., Est. sama; Maltese istess “same”; Hu. azonos “same”; Ch. xiang () “mutual”. Unlike its twin sister some, the same is spread far and wide, crossing language and linguistic family borders across Eurasia from Atlantic to Pacific. By heritage, contact or osmosis, the primeval word kept spreading across millennia. Its presence in the Celtic languages attests to its existence in the N. Pontic Türkic phylum, prior to the Celtic Kurgans' departure on a circum-Mediterranean voyage from the Eastern Europe to Iberia ca. 5th-4th mill. BC. Its presence in a fraction of the Gmc. languages attests to a guest status in the hosting languages, all of which without exception have their own native synonyms. No IE connection whatsoever, the proposed “PG” and “PIE” “restored proto-words” are mechanically reverse engineered phantoms. The Türkic pronoun ӧsüm, in contrast, is an attested reality. Demographically, the flow of farmers was eastward, to the Kurgan's lands. Linguistically, the flow was westward, from the Kurgans to the migrants. The palpable supplanting of the native European lexicons with the local lexis provide a vivid picture of amalgamation and co-existence. A paradigmatic transfer of the semantic contents, derivatives, and idioms with the base lexemes (e.g. oš-oš to so-so) attests to their origins in the local Türkic milieu. See ilk, so, some.

English second (adj., n., adv.) “ordinal of two” (Sw N/A, F347, 0.04%) ~ Türkic eki, iki (n.) “two”; eki, ekinti, ekkinč “second”, ikkiz “twins”. Ultimately eki “two” is a derivative of the verb qos-, qös- “add, join (two)”, noun qos, qös “pair”, Cf. kijin “after, later (of “1”)”. An A.-Sax. word for numeral “second” was aefterlic, lit. “following (one)”. Such definition of “second” is traceable in most languages. The ONorse forms positively bridge the Türkic eki/iki with a raster of the Gmc. forms (ikki > ykkar, igqar > unhar (unchar) > unker > unk), and lead to understanding of the Lat. and OFr. versions. A single consonant k/q was probably the root of a primeval notion of movement from one thing to another, Cf. Sl. k tebe (ê òåáå) “toward (you)”. The older Türkic suffix -inti (-inch) consists of selective suffix -in and ordinal marker -ti, Cf. ekinti “second”. The last corresponds to the Eng. ordinal suffix -th, Cf. tenth, it probably originated articulation . Its other Türkic form is -inč, the corresponds to the Eng. ordinal suffix -th. In different languages -ti/-č is reflected with the forms -th, -st, -t, -sht. A.-Sax. cognates of eki were used for a complementary notion “dual”. Transition from aefterlic “following” to the second is connected with the Norman conquest and is unrelated to the form unc “two”. Romance derivatives are formed with a prosthetic/prefix s- (se-) typical for the European zone, Cf. secludo “off-claudo” (“shut off”). It probably denoted a perfect tense verbal form as compared with infinitive, morphologically se- + -k- (-kw under “IE reconstructions”), Cf. Sl. delat - sdelat (äåëàòü - ñäåëàòü) “make (of clay)- made (of clay)”, Lat. sequi, secundus, and corresponding Gmc. forms. Besides derivatives of eki “two”, Türkic has 22 more words to express the notion “join, unite”, attesting to the innate societal role of cooperation, in a stark contrast with a vile image cultivated on the western end of the Eurasia. Ultimately, the form eki was reintroduced into Eng. disguised as second. Among 44 European languages, the Sl.-Türkic drug- “other” (< Tr. dürli “other”) is used by 9 (20%) languages, Romance-Türkic “second” is used by 8 (18%) languages, forms of duo- (< Tr. dürli) is used by 6 (14%) languages, Gmc.-Türkic anden (< Tr. andai “that, other”, pointing to a distant object) is used by 5 (11%) languages, Gmc.-Türkic tweede, an allophone of duo-, q.v., (< Tr. dürli) is used by 5 (11%) languages, Fennic toinen “that” is used by 3 (7%) languages, Balto-Sl. vtor- “further” (< Tr. dürli) is used by 3 (7%) languages. The remaining 5 languages use 5 their own native words. The Türkic roots are shared by 31 (70%) of the European languages, abundantly compatible with a 50.6% R1a/b demographic presence in Europe. There is no common European or IE “second”. Cognates: A.-Sax. unc (with k-), uncer, uncet (with k-) “two (of us)”, OSax. unc (with k-), uncero (with k-) “two (of us)”, OHG unch (with h-), unchar (with h-) “two (of us)”, ONorse okkr, ykkr, okkar, ykkar “two (of us)”, Goth. igqar “(of you) two”; OFr. second, secont, Lat. sequi “follow”, secundus “following”; Hu. kettö “two”, ikker “twins”, köt- “tie”; Kuchean (aka Tokhar B, Tuhsi) ikam “two”, Agnean (aka Tokhar À, Tuhsi) wiki “20”; Mong. ikire “twins”, ekis, ikes “placenta”, Kalm. ikr “twins”, Evenk igire “twins”; Kor. pegim “next, second, following”, Cf. čeim “first”. Distribution: truly Eurasian, from sea to shining sea. IE etymology is non-existent, it appeals to freshly cooked “PIE proto-root” *sekw- “follow” and “PIE participal proto-compound” *sekw-ondo-. Of those are credible only the attested local suffix ondo- and the notion “follow, further, that, other”. The form ikkiz, like its European calque twins, contains an archaic Türkic dual plural marker -s/-z, applicable to objects that come in pairs, like eyes (göz), horns (buynuz), twins (ikkiz). In English, the archaic Türkic dual plural suffix became a generic plural suffix -s, see -s. The dual plural suffix -s, besides Türkic, is used in Greek, it is absent in the European languages. A contrast between Gmc. and Romance forms attests that the Gmc. forms lived in an agglutinative Sprachbund, where words started with roots uncontaminated by prefixes. They carried roots well into the turn of the eras period of Romanized Indo-Europeization, while the Lat. and OFr. forms evolved and were internalized in a polysynthetic Sprachbund with vigorous preference to prefixation. The Balto-Sl. vtor- and Gk. hateros (ατερος) “following, different” attest to a presence of the word in the Corded Ware period. It is split-present both in the NW Europe and in the Hindustan peninsula, carried there in the 2nd mill. BC by the Indo-Aryan migrant farmers, Cf. Skt. vitaram, Av. vitara- “further” fr. Skt. vi “apart”. The Celtic forms Ir. dara, Scots darna (~ tor-) may even carry the dating to the 6th-5th mill. BC Eastern Europe before the Celtic departure on their circum-Mediterranean anabasis. Archeological evidence, genetic dating, and lexical evidence of absence of some but presence of other words in the Asian Indo-European languages signal nuanced linguistic transformations. They allow tracing of demographic flows and timing of demographic splits. A paradigmatic transfer of the complex first, second, and ilk, and of the integral morphological elements, emphatically attest to their origin from a Türkic lexicon. See first, Numerals – Preliminary Note, -s.

English sex (n.) “copulation, gender” (Sw N/A, F640, 0.02%) ~ Türkic sik (n.) “copulation, fuck” (of male only, with a female), sikiš (n.) “sex, copulation” (mutual). The term flagrantly carries a standard European etymological stipulation “of uncertain origin”. Ultimately fr. the noun root sik “penis” and its prime functions, a derivative of the verb sik-/siy- “urinate”, see sit. The verb sik- is a variation on the theme “water, liquid” expressed with a most common noun syllable su and its allophones su:/so/si/suv and the like. The prosthetic -v- with allophones -w-/-d-/-δ-/-g-/-γ-/-k- apparently emerged to link with the vocalic suffixes, and later fossilized as a part of the root, forming suv/su:v/suw/sug “water, liquid” and the like. The -g-/-γ-/-k- may have been denoting a result of action, in the case of sik “urinate, urine”, then “penis”, and on to “mating, copulation”, “gender”, and “sex”. Every European language has a diverse slew of synonyms for “copulate” unique for every language with a majority in incompatible terms obviously of particular native origins. The A.-Sax. came with numerous synonyms, some of them were iape/yape, sard, fucke (~ Ar. facara), swive. Cognates: A.-Sax. sard, OE (15th c.) fukkit, Gmc. fokken, fanden, faen, ficken, fiksje, and the like, “fuck, copulate”; Lat. sexus “gender”; Mong. sige- “urinate”; Manchu sike “urine”, site- “urinate”. The form sard echoes the Türkic forms sid and sidik “urine” (Alat/Haladj, Tabgach/Tofalar, Saryg Yugur, Khakass). The Khakass connection is most prominent among the Eng. Turkisms. A connection between sik and fuck is tenuous, predicated on s/h/f alternation. The s/h alternation is well established for the Aral-Caspian basin. Starting from 1st mill. BC, the southeastern Europe Kurganians repopulated Aral-Caspian basin, the s/h alternation there can be only dated to the post-Corded Ware period. The transition h/f is dubious, even with all Gmc. weakness for the initial f-. The Far Eastern connection is reliable, it retained the earlier semantics of “urinate”. A synonymous A.-Sax. iape/yape, with the cognates Skt. yabhati, Sl. ebat (åáàòü) “copulate”, a derivative of the Türkic noun eb/em/am “vulva, female” and a cognate of ewe, Eve, provides another attestation of the Türkic – A.-Sax. connection. The non-IE origin of the “sex” is obvious due to its singular European presence. It flagrantly carries the standard European etymological stipulation “of uncertain origin”. The IE efforts to fabricate a local connection for sex are pathetic, neither “division”, nor “to sever”, nor “cut up” fit the bill to any sane degree. Whether sex was inherited or passed via Lat., the sexual septet of sex, dick, cock, pussy, quim, ewe, and cynd/cunt provides ample evidence on the paradigmatic transfer from the Türkic milieu. See ewe, cock, dick, kin, pussy, quim, sit.

English some (adv., adj.) “unspecified, unknown” (Sw N/A, F93, 0.76%) ~ Türkic ӧsüm, ӧsǝm (relative indefinite pron., conj.) “(something) few, little, petty, unspecified, unknown”. Ultimately a deverbal noun derivative fr. the verb ӧš-/os-/aša- (emph. adv.) “(degree) excessively, very” fr. a base notion “to grow”, formed with a suffix -(a)m/-äm to denote a degree as an abstract noun, or result or object, Cf. gleam, glitter, Helios fr. Türkic yal- “shine”, seam, sew fr. Türkic sok-, suq- “sew”, also Cf. ošadïɣï, ošatïɣï “similar, corresponding” related to “same”, see same. A semantic derivative of ӧš-/os- is a pron. ӧša- “that”, “that one (mentioned)”, “as, like, similar, resembling, compatible”, “this way”, “so, thus, such”. The ӧš-/os- with suffixed demonstrative pronoun -o formed a type of ӧša/osa with alternating -a/-o/-ï which in turn led to the loss of the initial vowel, ӧša/osa > šo-/so-, and eventually to a rendering swa. That metamorphose had been described still in 1920 (Deny J., Grammaire de la langue turque (dialecte osmanli), §311, §1041. Paris, 1920). In applications, the emphatic ӧš-/oš- fused with demonstrative pronouns bu and ol: ošbu (is) “this”, ošol (passive -l) “that”. The oš- and ošbu are protagonists for the A.-Sax. swa “so” with š and b pronounced s and w, peculiar to the E. European and Gmc. articulations. The basic form ӧš-/os- grew into the A.-Sax. swa with elided indigestible ӧ-/o and a rich throve of meanings carried on from their Türkic ancestors. The swa branched to express Eng. “same” and “some”. The prosthetic g- (gleam) and h- (Helios) point to Oguzic articulation with an initial consonant/semi-consonant. Prosthetic initial h- is peculiar to Aral-Caspian area, now carried by Bashkir vernaculars. A Türkic adverb a:z/az “few, scanty, little, rare” is semantically and phonetically a reflex of ӧš-/os-, and may have a shared ancestry. It is held as an independent lexeme. Treated as two forms of an allophonic verb a:z/a:r- with s/r split, its basic notion is “macerate, slimmer”, which produces derivational “few”, i.e. it is synonymous with “some”. The form ar of a:z/a:r- also produced an Eng. conj. “or”, see or. Either etymology is sufficiently substantiated via attested ӧsüm or attested a:z as azam, semantically equivalent to “some” without appealing to any faux “proto-roots”. There is a third member, a personal pron. ӧz “self, own” that parallels ӧš- and a:z. Its function as a personal pronoun “self” echoes the demonstrative pronouns of the ӧš- “that”, “that one”, “same” and “some”. The lines ӧš- and az were found to be the most ancient. Only a their pale trace remains in Europe. Among 44 European languages, a Türkic root ne-, neke- predominates with 16 (36%) languages, followed by Romance algu- “some” 7 (16%) and Türkic ka-, ka:- “some, how, which? what?” 4 (9%), and then by Türkic ӧš- “some” 3 (7%) and Balt. da- 3 (7%) languages. The remaining 12 languages use 10 their own different native words. Türkic roots are shared by 23 (52%) of the European languages, matching a 50.6% R1a/b demographic presence in Europe. Cognates: A.-Sax. sam, sum “some”, OSax., OFris., OHG sum, ONorse sumr, Goth. sums, Du. sommige, Sw. somliga, Icl. sumir. Distribution is confined to a fraction of the Gmc. languages, attesting to a guest status in the hosting languages, all of which without exception have their own native synonyms. Even within Gmc. languages that have it, it may be dialectal, or survived only in derivatives or compounds. No IE connection whatsoever, the proposed “PG” and “PIE” “restored proto-words” are phantoms of wild doctrinal imagination. The Türkic pronoun ӧsüm, in contrast, is an attested reality. The relation with other meanings of the same root in the host and Türkic languages demonstrates a nebular semantic connection, a stochastic product confined to few vernaculars of the linguistic phylum. These properties of exclusivity provide an exceptional ethnological diagnostic value independent of the biological genetic tracings. Even within the framework of the relatively late Corded Ware period, it was a late acquisition, probably no earlier than the second mill. BC. At the same time, the phonetic and functional parallelism within the cluster os/ӧz/az/ar is striking, it suggests a primeval common origin. That would point to the word's belonging to a very early period of linguistic evolution on the Türkic soil, and help explain the indiscriminative nature or universalism of the A.-Sax. internalization. The swa also was a latecomer to A.-Sax./English, it took eons for it to supplant the native hwilc ubiquitous for the A.-Sax. See as, same, so.

English terra, terrain (n.) “grounds, ground for training horses” ~ Türkic tera (tera yeri), teriŋ, tirän (n.) “valley, lowland”, ter- (v.) “pasture”. A prime notion is ultimately connected with mountainous terrain and its valleys. A second, weakly connected notion of ter (n., adj.) is connected with water: “depth, deep”, “wide, width, open, hidden”. No listed Far Eastern connections. A Türkic ter “pasture land” is probably a derivative of er (yer, yeri) “earth”, a contraction of a phrase öt yer “grass-earth” with initial unaccented o- elided, fossilized with and without a possessive or obsolete intensive suffix -an/-aŋ/-än. The verb ter- must ascend to pre-domestication times, when hunter-gatherers' prime target were pasturing animals. It spread far and wide, becoming a focus of daily life in a pastoral economy and a most productive stem. Its allophones are based on interchangeability of back and middle vowels a-e-o-u > tar-ter-tor-tur. Semantic fields derived from “valley” > “pasture” developed into “land” (terra, territory, terrain), “dry land”, “flat land”, “land tract”, “valley”, “soil-tilling”, “hard labor”, “stop-over”, “stay over”, “dwelling” (Eng. tower), “tarry”, “earthwork”, then to “estate”, “village”, and so on. Many derivatives have reflexes in Indo-European languages. Taken as a group, these “Indo-European” derivatives do not find a common etiological stem, they wander like tipsy sailors on a way to a ship. Nowadays an international word found across the globe, Cf. Jap. tera, terein (テラ、テレイン), Hu. terra, terep. Cognates: A.-Sax. ðorp, ðrop “farm”, ðerscan “thresh”, OFris thorp, Fris. terp “farm”, ONorse ðorp “farm”, MDu, Du. dorp “farm”, OHG, Gmn. dorf “village”, Goth. þaurp “field”; OCS trezati (òðѣçàòè) “pluck”, Slov. trzati “graze, pluck”; Eng. place names ending in -thorp, -thrup. Distribution: Eurasian Steppe Belt plus 3 out of 10 Indo-European branches. An assertion of IE etymology of an origin fr. a faux “PIE proto-word” *ters- “dry” is way beyond incredible, on a level of terry or terse. Compared with semantics of “valley” or “lowland” pastures, an appellation “dry” is delirious, unless used by migrating birds, Archosaurians, or Pisces. The word had reached three European branches by three independent allochronic paths, reflected in different developmental scenes. Some derivatives came directly from the source language, usually rated as “of uncertain origin”. Others came via Lat. or French, and thus dead-end at Lat. Some more are supplied with unattested “IE reconstructions” of dubious relevance. The English cluster includes numerous words with the tVr stem: tarry, terra, terrace, terracotta (earthenware), terrace. terrain, terrene, terrestrial, territory, tower, turf (“surface of grassland”, aka Turfan “pastureland”). The Ir. and Scots Celts left Eastern Europe before domestication of Eutherians, invention of pastures, and of the term ter. The Welsh, apparently, migrated after such domestication. The word terrain carried over a non-adjectival suffix -an (Cf. adjectival Indian, riparian) of its ancestors teriŋ, tirän and their ilk. The complex of phonetic, semantic, and morphological coherence vividly attests to an origin from a Türkic milieu of a certain epoch. See Earth, turf.

English time (n.), timely (adv.) (Sw N/A, F75, 0.27%) ~ Türkic timin, demin (adv.) “time, moment, now, outright”. Ultimately an instrumental case of the adv. tem/tim “just, recently” with few more meanings, including homophonic “help” with its derivatives. The word belongs to an oldest lexicon (EDTL 187). Türkic has numerous words to refer to an abstract continuous time, but except qolu none of them have an element of time measurement: öd, öδ period, moment (see period); öδla choose, appoint moment, period; öδläk time period, time (generic); čer season, period; čerig, čerlik suitable moment; oɣur period < timely; qačan, qaju when; qolu ten second period; rüzgar epoch; tïδïn moment in time; turum àrà during (period, month, hour); tuš period, all time; vaqt defined period; zamana epoch, fate. It appears that a notion of certain time expressed by Türkic timin developed into an adverb timely, and then expanded to the notions of time duration (hours) and moment (10 o'clock) time (n.), and of a time flow time (n.). Cognates: A.-Sax. tima “limited space of time”, Dan. time “hour”, ONorse timi “time, proper time”, Norw. time “time, hour”, Sw. timme “an hour”, Faroese tími “hour, time”, Icl. timi “time, season”, Alemannic Gmn. zimen, zimmän “time, time of the year, opportune time”; Scots tym, tyme “time”; Fr. temps, Lat. tempus “time”; Pers. dǝm “moment” (< Tr. dǝm ditto); Mong. dem “timely (help)”. Distribution: staple in the Steppe Belt spilling to adjacent fringes. The IE family does not have a common stem for “time”. That makes all allophones of the form time guests from a non-IE family, with peculiar distribution among few Gmc. and Romance groups. There is no need for “reconstructed” ersatzes. No concocted “PG proto-form” *timon- or *timo or a faux “PIE proto-form” *dehimo or *dimon- or *dehy- “divide” is needed, the claimed fictions are eidetic with an attested Türkic timin. The incoherent IE etymology suggests unattested IE root *da- “cut up, divide”, an origin from a “tide”, a faux “PIE” *tempos “stretch” > “time” or a faux “PIE” *temh- “to cut”. The IE-locked queries are ludicrous, a wild goose chaise in search for a problem. Türkic is a good example on an antiquity of the notion of time, it had developed 11 discrete references to time, and probably some others did not enter a dictionary. A.-Sax. host had at least 3 phonetic forms (tima, tiema, tyma) plus a contracted form -tid in compounds, and at least 8 derivative forms. The semantics and phonetics of timely, time, and timin suggest down-to-earth etymology without any long-haul fishing expeditions and phantom conjectures. The time has no relation to cutting or tides leading to a word that existed in any human society from the days immemorial. A case of paradigmatic transfer of two time-related words, the timin “time” and öd “period”, attests inescapably to a genetic origin from a Türkic lexicon. See period.

English woman (n.) “female” (Sw36, F263, 0.07%) ~ Türkic ebe, eve, eme, ebi, ama, auml;mä (n.) “wife, mama”, ebi-, ebe-, eve- (v.) “engender, birth-giving”. Standing European etymological rating as “of uncertain origin”. A noun is recorded in a wide range of fairly close forms. It is ultimately derived from or has produced a stem eb/em “female genitalia” and its derivative emig “nipple, teat”, see tit. A component wo-, wi- in the compound “woman” (A.-Sax. pl. wi-: wimmen) relays a notion “female” and its siblings, see wife. Since woman is a neologistic compound, except for the part wo- it does not belong to a category of “basic words”. It endured a chain of correlated transformations, ebi- > wefi- > wif, where ebi- is one of allophonic forms, wefi- has added a prosthetic anlaut consonant, and wif is a fossilized A.-Sax. form for a “woman, female, wife, lady”. The part men > man > -man “adult human” ascends to a notion “I, me”, see man. A Gmc. genderless man “person” fossilized with A.-Sax. extended semantics of “person (male or fem.), mankind, brave man, hero, vassal, servant”. The word ebe belongs to an earliest period of the humanity, preceding an appearance of family-based society/relations. It is shared by genetically distinct lines, e.g. Mongolian (Y-DNA Hg. C ca. 40kya), European (I ca. 48kya, H ca. 48kya), Finno-Ugrian (N ca. 44kya), Manchu-Tungus (O1/2 ca. 40kya), Altaian (R1a/b ca. 24kya), i.e. migrations ascending to the Out-of-Africa times. Semantically, the term evolved from an earliest notion aba “senior, dominant” to gender undifferentiated aba “senior relatives” to gender undifferentiated aba “grandparents, father, mother” to gender differentiated aba, ebe “father, mother, uncle, aunt, etc.” The notions reflect the social structures of the times. Only spotty attestations of the earliest semantics have survived to modernity in some linguistic islets; intermediate traces are well attested; and the later appellations are well documented. The term reached literate times in two gender-differentiated lines, a male aba and a female ebe, each in a variety of allophones, 31+ and 15+ respectively. In the 2nd-1st mill. BC the aba “senior, father, etc.” was supplanted by a newcomer ata “father” which gained wide circulation across Eurasia (Cf. Lat. atta, Hunnic Attila, Turkish Ataturk) without impacting ebi “senior female”. There also were transitions from matriarchate to patriarchate (ca. 5th c. AD ±2 cc.), which impacted correlation of the terms. Continued amalgamations of newly confederated ethnicities impacted areal terminology. A.-Sax. had a rich native vocabulary for “woman”: -estre “fem. suffix”, heo, bitch, cilfor, cwene/cwine/cwyne, maeg, gemaeg, -oge (< ebe?), ðir, quean < Goth kwino “prostitute”, spinelhealf “female line (of descent)”, plus the “travelling term” ebi-/wif. Of 44 European languages, 27 (61%) use allophones of the Türkic ebe/eme and Eng. ewe, matching a level of the 50.6% R1a/b demographic presence in Europe. The remaining 17 (39%) languages use their own 15 different native words. There is no common “pan-IE” root. Cognates: A.-Sax. eowu “female sheep”, eow “sheep”, Eng. ewe “female sheep”, Eve “pra-mother” (< Tr. “mother, engender”; Biblical “pra-mother”), WFris. eiewe, female sheep”, Du. ooi “ewe”, vrouwmens “wife”, a neologistic compound lit. “free-woman person”, Gmn. aue “ewe, female sheep”; OIr. oi “ewe”; Lat. ovis “ewe”; Balto-Sl. (Lith.) avìs “ewe”, (Rus.) ovtsa (îâöà) “ewe”; Fin. åmä “mother (of animal)”, Est. ema, Hu. eme “ewe, bitch (animal)”; Kuchean (aka Tokhar B, Tuhsi) auw “ewe”; Nenets (Urak) newe, niemea “mother”, (Tavgi, Nganasan) name “mother”, (Selkup) àmó, åu “mother”, ima, åmà “woman, old woman”, (Koibal) nemyka “old woman”; Lapp. (Saami) ibme, ime “uncle's wife (mother's, father's side)”; Ostiak (Khanty) imi “husband's mother, husband's older sister, mother's older sister, wife of father's older brother”; Mong. eme, ǝm “woman, wife”, emege, ǝmeg “old woman, granny”; Kor. emeži, eomma (엄마) “mommy”. Terminological parallels across linguistic borders and geographies were numerously observed, noted, and attempted to explain. Although semantics may be area specific, the spread spans across the Eurasian width. For a phenomenon “of uncertain origin” any claim of a “reconstructed proto-word” is a quasi-scholastic absurd. There was no need to invent faux “PG” *awiz “ewe” nor a “PIE” *howis “ewe”. By the PG time, the word already belonged to a very hoary antiquity, its origins are fairly clear, and at which period. The W. Eurasian turbulence of the 3rd-2nd mill. BC, the two-way migrations connected with the period of the Central European “killing fields”, the demic displacements, dispersions and amalgamations drove intensive cultural exchange known only in a fuzzy outline. A mass of social and technological traits was disseminated extending from Iberia in the west to the Far East in the east. A Türkic-Scythian-Kurgan tradition related to the notion woman was a single combat between prospective bride and groom, with a bride as a jeŋ- “win” prize (i.e. a “trophy”), as opposed to a statutory (ebe- “wife”) and a favorite (sevig “beloved”). The A.-Sax. cwen, Eng. quinn, Goth qens, qeins, qineins, qino “wife, woman” ascend to that ancient tradition, eons before an invention of the family names. In a chain of innovations, the A.-Sax. wifmann “woman, female servant” (m., 8th c.) metamorphosed to wimman, wiman “woman-man” (fem.), i.e. exclusively a “female (adult)”. Notably, A.-Sax. used 10 ways to express “woman, wife”, of which 5 were Türkic cognates (bryd “bride, wife, consort” < be:r- “bear/carry, give”; bert “given”; awe “married woman” < eve- “engender”, Cf. ewe; cwen “woman, wife, consort, queen” ~ yeŋä/jenä “wife”; gemaeg “wife, woman” i.e. “of major, close (relative)” ~ bögü:/mögü: (m/b alternation) “might” , wif “wife, woman” ~ eve- “engender”); two belonged to the European vernaculars (famne “woman, maid, virgin, bride” Cf. Lat. femina “female”, ides “woman, wife”), and at least 3 were metaphorical extensions (freo, frowe “free”, haemedwif “married woman”, meowle Goth. mawilo, Cf. maw “mouth”). A complete complex, starting with the primeval aba and in a wealth of social reflexes has survived only in a body of the Türkic languages, attesting to a dissemination via a Türkic philological node. See Eve, ewe, man, tit, wife.

English yeah (ay, aye, huh, uh, uh-huh, yah, yea, yeah, yes, yep, yup) (interj.) “affirmative, affirmative response” (Sw N/A, F50, 0.70%) ~ Türkic yah, ye, yeh, de (interj.) “affirmative, affirmative response”. There is no common IE “yes”, no faux “PIE” ersatz. In Europe, Gmc. and Sl. are the groups that follow a Türkic trail. Of 44 European languages, Gmc. (ja etc.) leads with 18 (41%), and Sl. (da etc.) with 8 (18%) languages, for a combined total of 26 (59%), matching a level of the 50.6% R1a/b demographic presence in Europe. The other 18 languages march to their own tunes with 9 distinct native words. Like Türkic, English has numerous allophones and various spellings. In a usage frequency rating, Eng. allophones occupy a very prominent place (Table 1a): yeah (F47, 0.46%), yes (F89, 0.21%), uh (F130, 0.14%), huh (F199, 0.08%), yep (F1223, 0.01%), for a total of 0.89% usage frequency, or about every 100th word in a daily language. Table 1a summarily show them under a less frequent, but more formal entry yes. Cognates: A.-Sax. ge, gea, gese, gise, gyse, ise “yes” (g- ~ ž-) , Eng., Dan., Norse, Sw., Gmn., Slov., Welsh, Yidd., Latv., Fin., Est., etc. ya “yes”, Serb., Croat, Russ., Ukr. da “yes” (< Tr. de); Pers. ye “or”; Gujarati ya () “yes”; Ar. ya: “oh!”; Kalm. dzä, dzǝ “OK, good”. The listing is sorely incomplete since IE etymology does not poke its head beyond the IE dome, albeit studies do note strong parallelism of the “IE” and non-IE vocabularies. Other European forms: Fin. kyllä, Hu. igen, Basque bai; Lat. imo, Fr. oui, Sp., It. si, Port. sim; Slovak ano, Lith. taip, Gk. ινα, και, εaν, αλλa, δε (ina, kai, ean, alla, de), etc. Distribution is Eurasian-wide across linguistic borders. IE etymology offers a preposterous origin (for the Eng. “yes” only, not for the “IE” “yes”) from a compound of “so” + “to be” > gea, ge + si > yea, a desperately bizarre and unrealistic origin. With a complex of attested Türkic forms there is no need to invent new “PG proto-words”. For the Sl. allophone da and the Gk. allophone δε (de) it offers an auslaut part of random Gr. words ending in -δη (-dι). There is a handful of other, not any less bizarre suggestions beyond a point of absurdity. Notably, the consensus Swadesh List carries the “no”, but excludes the “yes”. The A.-Sax. forms, Sl. cognate da, and its Gk. version δε (de) with the anlaut consonant betrays a Türkic Oguric origin. Ditto the forms with an anlaut semi-consonant y-, j-, dz- (Cf. Hunnic Ogur titles in Chinese 2nd c. BC rendition Chjuki-Prince ~ Türkic jükü (Hunnic Ogur)/ükü (Oguz) “wise”; Ükü Bek; Luli-Prince ~ Türkic luli (Hunnic Ogur)/ulu/ulug/uluγ (Oguz) “great”, i.e. Ulu(g) Bek, etc.). The forms with a vowel anlaut either elided anlaut consonant or supplanted it with an Oguzic articulation. The attested allophone ya is a form of the attested Türkic yah “yes” (OTD p. 252). Although both are originally Türkic, only one is rated “IE”. The other is a neglected foster child. Go guess. For the English - Türkic pair yeah - yah, the semantic and phonetic equivalence are absolute, for the other allophones the same common Türkic origin is perfectly clear. See no, un-.

1.1 Personal and demonstrative pronouns (I, me, my, she, us, we, you, dual plurals, and demonstratives)
Interrogative, Reflexive, Reciprocal, Indefinite, and Relative pronouns are outside of this section

Table 5. Türkic–English pronominal correspondences
the Germanic-Türkic cognates are highlighted in bold;
the Türkic ikki “two, dual” is seen in Goth., ONorse dual okkar/ugkara and the field of their allophones
the Türkic Oguz b is rendered v and w in Germanic languages;
the Türkic Oguz z, s is rendered þ, ð in Germanic languages;
the Türkic Oguz conjugations are for illustration only, to show nuances lost in conversion
the Türkic sen/sin apparently used to be polysemantic, applicable to the 2nd and 3rd person.  
lst Person
Conjugation English OE Türkic Gothic A.-Sax. OSax. OFris. OHG ONorse Notes
Nom. I ik (ic) ič (ich), es ik ic (ik) ic it ik (ihha) ek
Gen. me min miniŋ, minüŋ meina min min min min min
Dat. me me miŋä mis me mi mi mir mer
Accus. me mek, me mini, minig mik mec, me mic, mi mi mih mik
Accu. Init. minigdin, miniŋtin
Accu. Loc. miniŋdä
Loc. mintä
Loc. Init. mindin, mindän, miniŋdin, mintin
Instr. minin, minniŋ
Poss. minin
Neg. minigsiz, minsiz,
Nom. wit biz (ikki) vit wit wit *wiz vit
Gen. uncer biziŋ (ikkiiŋ) ugkara uncer uncero unchar okkar
Dat. unc bizkä (ikkikä) ugkis unc unc *unch okkr
Accus. unc bizni (ikkini) ugkis uncit, unc unc *unch okkr
Nom. we git biz veis (weis) we wi, we wi wir ver
Gen. us incer biziŋ ös/öz unsara user (ure) user user uncar Var (vor)
Dat. us inc bizkä ös/öz unsis, uns us us us uns oss
Accus. us inc bizni ös/öz unsis, uns Usic, us us us unsih oss
2nd Person
Nom. thou þu (thu) sen þu (thu) þu (thu) thu thu du þu (thu )
Gen. thou þin (thin) seniŋ, seniŋdä, seniŋdin Þeina þin thin thin din þin
Dat. thou þe (the) saŋa, saŋar, seŋär þis þe thi thi dir þer
Accus. thou þek (thek), þe (the) seni, sini þik Þec, Þe thic, thi thi dih þik
Nom. ikki ös/öz *jut git, gyt git *jiz, iz it, þit
Gen. ikki ös/öz igqara incer *incero *inchar ykkar
Dat. ikki ös/öz igqis inc inc *inch ykkr
Accus. ikkini ös/öz igqis incit, inc inc *inch ykkr
Nom. ye   biz senlar jus ge gi, ge i, gi ier, ir er, þer
Gen. ye   biziŋ seniŋ + pl. aff. izvara eower iwar iuwer iwar yðar
Dat. eow   bizkä saŋa, saŋar, seŋär + pl. aff. izvis eow iu iu, io iu yðr
Accus. eow   bizni seni, sini + pl. aff. izvis eowic, eow iu iu, io iwih yðr
3rd Person
Nom. he, she   ol is, sa he m. heo f. hit n.
Gen. him, her   anuŋ seina sin sin sin sen/sin?
Dat. him, her   aŋà sis him (sig, sih, sic) ser sen/sin?
Accus. him, her   anï, anïŋ sik (sig, sih, sic) sih sik sen/sin?
DUAL -None-
PLURAL of he, heo, hit /// pron. 3 pers.
Nom. they   olar he m. heo f. hit n.
Gen. them   anuŋlar scina sin sen/sin?
Dat. them   aŋàlar sis him, heom, eom, inc, (sig, sih, sic) ser sen/sin?
Accus. them   olarnï , olarda sik (sig, sih, sic) sih sik sen/sin?

Immediate impression shows a time and geography dependent vanishing cline from basic to complicated. The most basic Türkic pronominals, learned in a childhood or early personal contacts, are internalized with minor changes. The rarer complicated forms are internalized with a loss of grammar and accuracy. The rarest most complicated forms are not mastered at all. Internalization is reduced with geographic distance. Internalization among the oldest ancestors, Goths, ONorse, and A.-Sax., had retained grammatical elements of the the dual form. In Türkic it originally was reserved only for the paired objects, like eyes and river banks; in English it grew to a plural -s. Those folks also became famous for the most consequential adventures of the Goths, Alans, Vikings, and Northmen, the aftermath pursues us into the present, including a devastating impact on the Eastern European Khazaria, and the discovery of Iceland, Greenland, and America.


English he (pronoun) “male personal” (Sw3, F37, 1.41%) ~ Türkic hu, šu (shu) (pronoun) “this/that”. The Eng. he is a result of sequential diversification and specialization of the universal undifferentiated basic polysemantic šu to the allophonic se, he, and on to he and she. The attested h- allophones heo, hio of the seo are the Aral-Caspian area reflexes ho/hu of the common Türkic šo/so/šu. By the 13th c. the form se “he” faded, supplanted by he, while the forms seo, sio “she” stabilized as she “she”. The transition from the A.-Sax. form se to he attests to the significant demographic presence of the Aral-Caspian element in the pre-13th c. society, the genderless forms se and he continued to coexist till they were forced into the dominant gender-sensitive grammar, which induced speciation. Likely, in a male-dominates society, he tended to become gendered, while se remained genderless, akin to “it”, and applied to chattel, females, and other property. There was no “shift” of he from the female to male, in both instances the shift was from genderless to gendered. The Türkic šu is genderless, it comes in flavors šo/so and šu, with a Sprachbund-originated allophonic articulation ho and hu. Türkic has at least 6 basic ways to express the demonstrative, as opposed to the conjunctional, “that”: o/ol, so/šo/šu/sol/šol/šul, leš, te/ti/tu/tet/tes, an/ïn/un, and gol; of that lineup, the šu is but one particular form. Romance languages had inherited leš, an ethnically diagnostic trait. The A.-Sax. forms of the šu are masc. se, and fem. seo, the se/seo incorporated into A.-Sax. with the entire semantic bouquet of its Türkic sibling: personal, demonstrative, and relative pronoun, see so, this, that. The A.-Sax. neut. personal pronoun ðat, ðaet “it, that” is a part of the A.-Sax. triplet se, seo, ðaet. It originated fr. the Türkic form Romanized as tet, etymologically unrelated to the form šu, it is a derivative of the verb te- “tell, state” used for the neut. pronoun “it”, as a conjunction “that, so that, in order that, after that, then, thence”, and it developed into the article the (Sw N/A, F7, 2.92%). Cognates: A.-Sax. he, heom, hea, heora, hi, hie, hyra, se “he” (nominal, dative, accusative, etc., forms), the nom. he and se are allophones, OSax., OFris., MDu. he, hi, Du. hy, OHG he. The ethnic diagnostic bifurcation points to an amalgamation of the vernaculars closer to the CT (so//šo/šu) with the vernaculars closer to the Aral-Caspian area (he, hea, hi, hie). Under the IE paradigm, ðaet etymologically is erroneously confused with the origin of the unrelated personal pronouns se and seo. The IE etymology offers a PG “proto-word” *hi-- and a PIE “proto-word” *ki-/*ko- “this, here” supported by the attested Hitt. (16th c. BC) ki “this”, Gk. sou (σου) “you”, ekeinos “that person”, Balto-Sl. (Lith.) šis, (OCS) si “this”. The attested Hitt. ki and Gk. -kei- happened to be allophones of the attested Türkic (Chuv.) ko “this” and the OT nominal suffix -oq, with numerous cognates in the Far Eastern languages (Dybo A.V. 2013, p.493). The Balto-Sl. ši/si and the Gk. sou happened to be allophones of the attested Türkic form “this”, an allophone of the same Türkic šu (q.v.). Neither Hittites nor Greeks nor Balto-Slavics are suspected to have originated in the Far East. There is no need to invent PG and PIE “proto-words”. The total overlap of the PG/PIE conjured “proto-words” with the attested Türkic lexicon carry a ghostlike character, like a shade glued to the Türkic soles. The carryover of the personal pronouns I, we, he, she is an authentic case of paradigmatic transfer attesting to a common genetic connection in demographic and linguistic aspects. The two derivatives of šu, he (0.57%), and she (0.42%), and 3 derivatives of te-/tet/tes the (2.92%), that (1.57%), and this (0.95%)” are a most popular group in English, contributing combined 6.43% frequency usage, bringing the total frequency usage of Turkisms in English to reach over 40%. See she, so, this, that.

English I (pronoun) “1st pers. sing.” (Sw1, F2, 5.50%) ~ Türkic ič, es (pronoun) “I, 1st pers. sing.” (OTD p. 201, “3. auxiliary postposition I”), self, a semantic extension of the “inner”. The pronoun is declined as any other noun: ičinte: beg “I (was) a bek”, balık ičiŋe “I (entered) town” . The Gmc. Ich, Ih “I”, English I “I” and the attested in Türkic runic script Khazar (ik) Ik “I” are allophonic forms of the form “inner”. The same ligature depicts a particle ok/ök, a preceding word defines the vowel. In Türkic syntax, attached to personal and demonstrative pronouns, the ok/ök is synonymous with the pronoun, e.g. (ben) ök “I, me”. Used in an alien syntax as a separate word, it directly interprets and forms a notion “I” as a most frequent word. Without vowel indicated, it is up to a reader to articulate it as i-, o-, or ö-. In practice, the three articulations conflate to a single accepted Sprachbund version. The word had a pronounced areal distribution, never used in the south-east, and long waned in the south-west, leaving it to circulate exclusively in the north-west (EDT, p. 76). As an emphatic particle particle, it forms the word OK (o'kei), see OK. The form is Common Türkic (Oguz), the phonetic renditions of the Khazar “I” (likely Ogur) waver between -k and -x (kh): Ik/Ix/Ikh/Ih/Ich. The modern English form (12th c.) is a contraction of the A.-Sax. ic, 1st pers. sing. nominative pronoun. There is a semantic grey area between “I” and “me” that points to their interchangeability. The English form is ultimately derived from the Türkic allophonic form  via Gmc. ih without voicing and aspiration. Cognates: A.-Sax. ik (spelled ic), OFris. ik, ONorse ek, Norw. eg, Dan., Norw. jeg, OHG ih, Gmn. ich, Goth. ik, Icl. eg; Ir. agam, me ag “I mine, me I”, Welsh wyf; Balt.-Sl. (Lith.) , (Latv.) es, (Sl.) ja; Lat. ego, Fr. Je; Gk. ego (εγω), ekho (εχω) “I, me”; Fin. itse “me, self” (initially “soul”; Tungus eče-sin “turn inside”; Skt. ah(am); Hitt. uk “I, me”. Except for a more remote Welsh form, all cognates are immediate allophones of the Türkic (Gmc., Celtic, Gk., Skt., Hitt.) or es (Lith., Latv, Sl., Fin., Tungus). The Türkic word me (min, men, bin, ben) is an objective form of “I”, it appears to have been in circulation before the appearance of the word as “I”. The Irish compound form is peculiar, it literally combines the semantics of “inner” with the semantics of “me”. The dictionary should also list them separately. The Irish agglutinative agam is a compound of ag “I” homophonous with “inner” and possessive suffix -m standing for “mine”, forming a notion “I” from the compound “mine inner” distinct from the “inner”. The Ir. analytic compound me ag serves the same function, it combines the objective form me of “I” with the discriminant “inner”, creating the notion of “I” from “me inner”. The Irish has preserved the frozen archaic forms in agglutinative and analytic syntax versions from the time when, without a definition “me”, “mine”, the still denoted “inner”. They attest to the existence of the components me and ag in the 5th mill BC, prior to the Celtic circum-Mediterranean migration. They allow a peek into the Kurgan culture's prehistoric vernaculars in the N. Pontic area spoken by the R1b-marked people. Most of the European forms ascend to the form , but the archaic Lith. and Latv. forms differ, they point to the origin fr. the Türkic phonetic forms of es. The Skt. form is predictable, it reflects the typical s/h alternation in the Aral-Caspian basin. It could ascend to either or es, but it can't reflect a later Iranic phonology since that is a parallel prong of the 2nd mill. BC Indo-Aryan migration. In agglutinative languages like Türkic and Sanskrit, the 1st, 2nd, or 3rd pers. is indicated by modifying roots with corresponding suffixes, and the use of the 1st pers. sing. pronoun is minimal. In Türkic, the objective form of the personal pronoun is morphologically an individual lexeme and a suffix marker, used individually or in combination depending on the syntax of the sentence. With a switch to the syntax of the flexive languages arises a need to separate the agglutinated pronoun suffixes into individual lexemes. The Hitt., Gk., and CT forms are nearly identical. The IE etymology ascends the IE forms for “I” to an unattested PIE *eg-, without addressing its source and etymology, and disregarding the crucial Türkic and Türkic connection. The distribution of the word spans Eurasia in a wide ark from Atlantic to Mongolia. See me, OK.

English me (pronoun) “1st pers. sing. accu.” (Sw N/A, F10, 1.18%) ~ Türkic min, ben, men, mən (pronoun) “me”. Ultimately fr. 1st pers. sing. pronoun bi- “I, me” with a possessive suffix -n. Male/fem. gender may differ by vowel. With the b-/m- alternation, me/be, min/bin, and men/ben belong to the same cluster, and the Türkic and others' men/ben is as good for “I” as it is for “me”. There is a semantic grey area between “I” and “me” in both languages that points to their interchangeability. In Türkic, min is regular, ik is a metaphoric extension. In English, both forms I and me have equal standing. Dative agglutination is preserved in meseems, methinks. Cognates: A.-Sax. oblique cases of I me (dative), me, mec, meh (acc.), OSax. mi, OFris. mi/mir, MDu. mi, Du. mij, ON, Goth. mik, OHG mih, Gmn. mich; OIr. me, Welsh mi “me”; Latv. manis (gen.), man (dat.), mani (acc.) me”; Lat. me; Gk. eme; Skt., Av. mam; Sum. me. The Balt. (Latv.) forms match the modern Turkmen (Oguz) forms of the personal pronouns “I” men: “me” menin (gen.), mena (dat.) “me”, meni (acc.). The Celtic forms attest the presence of the form me before they departed on their circum-Mediterranean migration in the 6th-5th mill. BC. The Sum. me confirms the presence of the form me in the 4th mill. BC. I and me are two members of a massive class of the personal pronouns carried to the A.-Sax. and on to English as a case of paradigmatic transfer, directly attesting their origin in the Türkic phylum. See I, my, un, us.

English my (pronoun) “possessive of me” (Sw N/A, F20, 0.87%) ~ Türkic suffix-m (possessive pronoun) “of me, mine”. Semantically identical, both pronouns indicate belonging to the 1st person sg. The Türkic suffix -m is also likely a contracted form of the suffix -min. The Türkic pronoun suffix -m moved to become an Eng. prefixed particle my, and the original pronoun suffix -m was dropped: аčïm (achym) > my ache (my ayk). The original form is preserved in the Gmc. cognates: OFris., OSax. OHG min, MDu., Du. mijn, Gmn. mein, ONorse minn, Goth. meins, eidetic to the Türkic pronoun min “me”; Ir., Gael. mo, Welsh 'm, fy. Welsh has retained my as an agglutinated grammatical suffix. The earliest recorded cognate is the Sum. me “me” attested from the 4th mill. BC. The form my is a contracted form of the Gmc. mine that has survived in the Eng. form mine. See me, un, us.

English she (pronoun) “3rd pers. sing. fem. personal” (Sw N/A, F50, 0.60%) ~ Türkic šu (shu) (pronoun) “this/that”. The Türkic šu is genderless. The A.-Sax. forms of the šu are masc. se, and fem. seo, the se/seo incorporated into A.-Sax. with the entire semantic bouquet of its Türkic sibling: personal, demonstrative, and relative pronoun, see he, this, that. The A.-Sax. ðat, ðaet “that” originated fr. the Türkic form tet, a derivative of the verb te- “tell, state”, it was used for neut. pronoun “it”, and as a conjunction “that, so that, in order that, after that, then, thence”, see that. Functionally, it was included in the triplet se, seo, ðaet. Etymologically, ðaet is erroneously confused with the unrelated personal pronouns se and seo. Cognates: A.-Sax. se, seo “he, she”, heo, hio “she” (semantics shifted to “he” by 13th c., see he), seo, sio “she” (after 13th c., fem. form of “this/that”). Shared across Gmc. group, she is cognate with all Gmc. cognates of the Türkic šu “this/that”, including the Du. zij, Gmn. sie, see this, that. The Eng. she is a result of diversification and specialization of the allophonic forms of the universal undifferentiated basic polysemantic šu to the allophonic se, seo, and on to she. The Balt. (Latv.) preserved supposedly archaic form šis (shis) of šu. The attested h- allophones heo, hio of seo are the reflexes ho/hu of the common Türkic šo/šu in the Aral-Caspian area. Instead of the original attested pronoun te-, the English linguists erroneously link the A.-Sax. ðat “this/that”, which tuned into the article the, with the form of fem. seo. The derivatives of šu “he, she”, and te-/tet/tes “the, that, this” are the most popular group in English, contributing, 0.90%, 3.21%, 2.16%, 1.04% , and, 0.66% frequency usage respectively, for a combined 7.98%, and bring the total frequency usage of Turkisms in English to about 41.65%. See he, the, that, this. (Update frequency table)

English that/this (pronoun) “adjectival demonstrative” (Sw8, F7, 1.96%) ~ Türkic tet- (v.) “oppose, balk at, differ”, tat (n.) “alien, opposite”. A group tan-/tän-, tet-, tat is a semantic cluster with a notion “opposite, opposing”, it echoes a directional element ta/da/te/de. These complementing forms forge an adjectival opposition “other”, “not us”, “other from this”, Cf. tat/dat “rust, altered metal”; Tat, Tatar “alien, stranger”. The Türkic pronoun bases te-, ti-, tu- “that” reflect that notion in a deverbal opposing adjectival derivative form. Widely dispersed, it has numerous allophones with t/d and a/ä/e/ǝ/i/ï alternations. At least in some languages it was articulated with an initial th/þ/δ/ð, not registered in Romanized transcriptions and mostly transcribed as as Roman t/d. The grapheme þ (th) was first registered in the runic legends of the Middle Asian Türkic coins, q.v. In the Türkic milieu, tet-/that served solely as an opposing pronoun “that”. A slight phonic modification from that to this with a native definitive suffix -s took place within the Gmc. milieu, but see this. It may have based on the version thit- of thet-. Grammatically, the derivative tet- is a passive form of the verb te- “tell” formed with an abstract suffix -t, Cf. assault fr. sal- “beat”. The literal tet- is an equivalent to the Eng. idiomatic told/said “above-named”, see tell. The tet-/that branch has produced too numerous Eng. offshoots to list: their, there, this, etc. The Eng. and Gmn. articles the and die/der/das/den are allophonic with the Türkic te-, and the Gmn. “that” jener/jenes is allophonic with a popular Türkic line an/ïn/un, articulated with an Oguric prosthetic anlaut j-. In addition to tet, Türkic has at least 6 other ways to express the demonstrative, as opposed to conjunctional, “that”: o/ol, so/šo/šu/sol/šol/šul/, an/ïn/un, and gol. One way or another, they left an enduring trail in the Gmc. languages. That listing is a key to the IE etymological interpretations. Cognates: A.-Sax. ðat (aka þæt), ðæt (pronounced that), OSax. that, OFris. thet, MDu., dat, Gmn. die/der/das/den; Balto-Sl. (Lith., OCS) to, ta, Sl. tot (òîò); Skt. ta-. These forms are specifically a demonstrative adjectival opposition “that”. The also cited Lat. talis “such”, and the Balt. (Latv.) šis (shis) may also be unlikely cognates. The Gmn. forms are conflated with the semantically contrasting adjectival demonstrative pronoun “this”, and also are used as articles and conjunctions, similarly to the Eng. “that” and “the”. The English “that” reportedly emerged ca.1200, a pure nonsense in light of the attested A.-Sax., OSax., OFris., and MDu. forms and the continued survival of the Middle Asian letter þ. The A.-Sax. ðat, ðaet was largely a pronoun and conjunction “that, so that, in order that, after that, then, thence”; semantic differentiation formed with amalgamation of local vernaculars. Etymologically, the directional that/this is erroneously conflated with the unrelated personal pronouns masc. se, fem. seo of the Türkic šu, probably due to overlapping semantics and use, see she. Signally, mirroring the past conflations, the IE-centered etymology piles Gmc. pronouns into one mushy heap. It postulates an obviously non-IE PG “proto-word” þat, that with gooey semantics in a slightly peculiar articulation of the attested and widely used Türkic tat. Apparently to cover the Sl. cognates, the phantom PG “proto-word” is further interpolated to an IE “proto-word” tod-. Apotheosis is the fabricated an “extended form of demonstrative pronominal base *-to-”, a distortion of the attested Türkic verb te- that grew into numerous Eurasian Türkic and Northern European lineages. The case of the paradigmatic transfer between the Türkic phylum and the nearly exclusively Northwestern European Gmc. and Balto-Sl. languages is demonstrated by the quartet of the stems tet, te, šu, and an (i.e. jener) attesting to a common demographic and linguistic genetic provenance. The specificity of the form tet carries a precious diagnostic value, to complement the diagnostics of the genetic and archeological disciplines. The quintet of demonstrative pronoun/article derivatives of te-/tet/tes is a most popular cluster in English, contributing a combined 6.20% frequency usage even without accounting for declined forms like them and their: “the” (3.03%, rating 4), “that” (2.02%, rating 7, 55), “this” (0.98%, rating 14)”, “these” (0.09%, rating 182) and “those” (0.08%, rating 200). That brings the total frequency usage of Turkisms in English to exceed 40%. See he, she, this.

English this (pronoun) “adjectival demonstrative” (Sw7, F14, 1.02%) ~ Türkic teš-, deš- (v.) “dialogize, discourse (in dialogue)”. Teš- is a reciprocal form of the verb te- “tell, state” formed with a suffix , see that. The semantic extension to the notion “this” vs. “that” originates in the contrast of the forms teš- “acting together, us” vs. the adjectival opposition tet- “other, not us”, “not this”. Both verbs are derivatives fr. the verb te- “tell”, see that. An alternate, IE-centered etymology suggests a slight phonic modification from that to this with a native definitive suffix -s that took place within the Gmc. milieu. It may have based on the version thit- of thet-, see that. The literal tet- is an exact equivalent to the Eng. idiomatic told/said “above-named” (Cf. “that, not this...” and “above-said person, event”, etc.). The literal teš- is a reciprocal equivalent of the same (Cf. “this, not that...”, “this particular...”, etc.). It is fairly clear that the demonstrative pronoun teš “this” could only form on the model of tet “that” after the tet gained a local circulation as an opposing demonstrative pronoun, Cf. Sl. semantic extension tot (òîò, “that”) to etot (ýòîò, “this”, a local innovation, on the model of “periodic/aperiodic”). In addition to teš, Türkic has at least 3 ways to express the demonstrative “this”: bu/bo/pu/po, ko, sa/so/šo/šu/šï. English, in contrast, has only one semi-synonym, such, and that is an allophone of the Türkic sa/so with adjectival -lïɣ “like”, Cf. A.-Sax. swilc “such, like this”. Cognates: A.-Sax. ðas, ðaes “this”, OSax. these, OFris. this, ONorse þessi, Icl. þetta, MDu. dese, Du. deze, OHG deser, Gmn. dieser, das; Balto-Sl. (Lith.) tai, (Ru.) etot, Cz. tato, tyto, Pol. to, Bulg. toya; Ch. 此 cǐ (che). Across Europe and far beyond float numerous semantic inversions: allophones of Türkic synonymous šu “this/that” are retained as “this” in the Celtic (Ir., Gaelic) seo; Balto-Sl. (Latv.) ši, šis, (Sl.) sey; Arm. sae; Est. see, Fin. se; Ch. 此 cǐ (che), see he. The Gmn. article das, a counterpart of the Eng. the, transparently ascends to the Türkic allophone deš- “this”, and ultimately to the notion “tell”. The Celtic seo points to its priority relative to the versions of te-, it indicates a separate migration path from the apparently later synonymous pronouns of te-. The Ch. form may equally hail fr. the Türkic šu “this/that” and te- “tell, state”. The distribution across Eurasia and across linguistic families clearly exceeds the boundaries of the IE-cloistered etymology, which is bound to no cognates outside Gmc. The distribution is certainly peculiar, limited to the northern Eurasian arch plus a Chinese tongue. The IE-centered etymology equates “this” and “that”, and postulates an obviously non-IE PG “proto-word” þat, that with mushy semantics and slightly peculiar articulation of the widely attested and used Türkic tat “alien, foreign”. The phantom PG “proto-word” is interpolated to an IE “proto-word” tod-, apparently to cover the Sl. cognates like tot (òîò) “that” (not “this”). In the apotheosis, the IE etymology devised an “extended form of demonstrative pronominal base *-to-”, which is obviously a slight distortion of the attested Türkic verb te-. Finally, the IE-centered etymology ascends *-to- to an “PIE” super-proto-word *so “this, that”, which is obviously a slight variation of the attested Türkic šu “this/that”, see he, she. The last leap fr. a phantom *-to-” to a phantom *so is apparently driven by the attested A.-Sax. se “he”, seo “she”, ðaet “it, that, this” ascribed to a single super-proto-word, instead of the distinct and attested Türkic šo/šu, its -h- allophone ho/hu from the Aral-Caspian area, and the te- for undifferentiated “it, that, this”. The asterisked phantoms stop being fantasms the second they are replaced with the Türkic regular and attested words. Two more English pronouns have developed from teš “this”, these and those. The sheer number of Türkic precursors provides a striking evidence of a paradigmatic transfer case. Five pronoun/article derivatives of te-/tet/tes, the (3.03%, rating 4), that (2.02%, rating 7, 55), this (0.98%, rating 14)”, these (0.09%, rating 182) and those (0.08%, rating 200) are a most popular group in English. They contribute combined 6.20% frequency usage without accounting for declined forms like them and their. That brings the total frequency usage of Turkisms in English to reach over 40%. See he, that

English us (pronoun) “personal accusative and dative plural of we” (Sw N/A, F 104, 0.19%) ~ Türkic ös/öz (üs/üz) (pronoun) “self (we, us, selves)” (OTD p. 394). The generic notion of ös/öz/üs/üz is “inner, core”, declined as any other noun, it is synonymous with the generic notion of (Gmc. ik, ich) “inner” that stands for sing. “I”, see I. Use of synonyms probably developed to avoid confusion, a blurring synonym went extinct. With appropriate suffixes, ös forms notions “me/us, you”, and “he/she/they”, eidetic with the notion that forms the notions “I, myself”. The preserved semantics of the ös/öz/üs/üz derivatives suggests that at one time both synonyms were used interchangeably to form a universal line of personal pronouns, and that specialization occurred at amalgamation with alien languages. The Gmc. group settled on the semantics “us”. That is corroborated by the pronoun “you, yours”, a derivative of the pronoun suffix -üŋ, -uŋ. Cognates: A.-Sax. us, OSax., OFris. us, ONorse, Sw. oss; Sum. ez. The -n- form was active in Gmc., Celtic, Lat., Gk., Sl., Skt., and Hitt. languages: Du. ons, Gmn. uns; OIr. ni, Welsh ni “we, us”; Lat. nos “we, us”; Gk. no “we two”; OCS ny “us," nasu “our”; Skt. nas, Av. na; Hitt. nash “us”. The IE etymology conflates the -s- and -n- forms into a dubious unattested phantom “proto-word” *ns. Distribution of the allophones of the -s- and -n- in Europe, Asia and across Eurasia is consistent with N.Pontic serving as a sanctuary for European refugees from the carnage inflicted during the 3rd mill. BC on the old European farming populations marked by Y-DNA haplogroups G2a, E1b-V13, I1, I2, and R1a. From the N.Pontic started a forked migration of the peculiar -n- form to the south-central Asia (Skt.) and comeback to Europe (Lat., Gk.). The OIr. and Welsh -n- forms point to the presence of the -n- form in the N.Pontic as early as the start of the Celtic circum-Mediterranean migration in the 6th-5th mill. BC. The Sum. ez attests the 4th mill. BC timing of -s- form. See I, me, my, un, you.

English we (pronoun) “1st pers. pl.” (Sw4, F29, 0.81%) ~ Türkic biz, bez (pronoun) “1st pers. pl., we”. Ultimately, biz is a copula, the notion “we” is a functional extension forming dual plural (Cf. biz (ikki) “dual, copula”, and Goth., ONorse vit, OHG wiz “dual we”), which further expanded to inclusive plural. In Türkic daughter and affiliated languages b- may take a range of qualities Romanized as p-, v-, and w-, with further variations. Cognates: A.-Sax. we, OSax. wi , OFris. wi, OHG wir, ONorse ver, Goth. veis (weis), Dan. vi, Du. wij; OIr. mu-id (-id plural), Welsh ni; Balto-Sl. (Lith.) mes, ve(du) “we (two)”, (Latv.) mes, (OPruss.) mes, (OCS) my (ìû) “we”, ve “we (two)”; Skt. va(yam); Av. va(em), OPers. vay(am); Hitt. wesh; Sum. me; Hu. mi, min-k; Est. me, Fin. me; Arm. måk; Ch. wo; Jap. wa-; Mong. bid (áèä). The sampling of cognates across Eurasia demonstrates that the European and Asian IE forms are related, each with its own flavor, but are a particular subset of the underlying forms split along the m/b divide that crosses the entire IE linguistic family. The m/b split and co-existence extend across the eastern and western parts of the Eurasia. The European Romance group has its own independent form: Lat. nobis, Fr. nous, etc., Cf. Sl. nash (íàø) “ours”. The oldest on record is the Sum. form, leading to a proposition that the m- form was a prime form, which separated into the m- and b- lines, with the b- line diffusing into areal b-/v-/w- variations. That is corroborated by the OIr. m- form. Both the Sum. and OIr. -m forms are datable, the Sum. form by the appearance of writing, and the Celtic form by the time of the Celtic departure from the E. Europe, by the 4th mill. BC. Other diagnostic traits are the inclusivity vs. exclusivity of the word, the presence of the dual form, the interrelation between the notion “we” and the notions “I/me” and “you”. The n- forms may putatively ascend to the Old Europe vernaculars of the 4th mill. BC, prior to the period of the “killing fields”. The major overlap between the Gmc. and Türkic personal pronouns shown in Table 5 provides a major attestation of the paradigmatic transfer case, and the indisputable evidence of the common origin. See are, I, he, she, us, you.

English you (pronoun) “2nd pers. sing. and pl.” (Sw2, F1, 6.21%) ~ Türkic -üŋ, -uŋ (pronoun suffix) “2nd pers. sing. and pl., and respectful”. The suffix initial -ü/-u is most typical, but can be ï/i/a/ä and more in consonance with the phonetics of the base root. The auslaut ŋ is most typical, but can be g/ɣ/q and more depending on the application: verbal, nominal, declination, and conjugation, and the component -üŋ- participates in many of them. The wealth of the different forms, packaged with and between other agglutinated suffixes and in unrelated roles, is nearly incomprehensible for the alien ears. A most prominent phoneme -ü-/-u- had been internalized and reshaped into the native grammars and articulations, and then codified into the native scripts. The Türkic suffix is semantically exactly the same as the detached English pronoun. In English, by 1450s the 2nd pers. sing. respectful became a general norm, and the form thou gained connotation of disrespect or intimacy. If not earlier as a Sprachbund norm, a loss of the nasal consonant and a transition from a suffix to a separate word occurred in transition to an alien phonetics and syntax. The most basic personal pronouns I, you, he were internalized in their entirety and verbatim. More complicated forms were internalized partially, some became indiscriminant 3rd pers. generic (he + she + they), some may have retained a native base, and others have not survived in the host languages. Dual plural was partially absorbed but expanded to a generic “both”, breaking the original pinpointed connection with pared objects like the pairs of ears. Cognates: A.-Sax. ge, git (dual), eow, OEng. ye, ge, OSax. iu, ye, OFris. iuwe, ONorse yor, MDu. ghi, Du. u, gij, OHG iu, iuwih, German euch; Lith. jus; Hu. ön; Lat. vos, Fr. vous, Sp. usted, usteres; Gk. hymeis; Sl. vy (âû); Skt. yuyam, Av. yuzem, Fin. sinua. Saliently, most cited European cognates carried the Türkic-type indiscrimination between 2nd pers. sing. and pl., a unique trait of paradigmatic semantic transfer. Essentially, there are no citable IE cognates from other IE branches. The faux “PIE” *yu is unwittingly an allophone of the attested Türkic . An idea that the Corded Ware Kurganians spoke an IE language and the European people of the Corded Ware spoke some unknown non-European languages is beyond absurd. In addition to the Eurasian Türkic languages, the distribution is limited to the European zone and its vicinities, crossing linguistic family boundaries. The Lat. and Fr. sport a prosthetic initial v-, also an earmark of the Sl. languages. Outside of the literary usage in Eng., and in daily use elsewhere, the pronoun thou, a detached cognate of the Türkic past tense 3rd pers. suffix ti, continued to be used till present, another case of paradigmatic transfer. In light of the 2015 discoveries on the genetic composition of the Corded Ware, a reasonable suggestion would be that the -üŋ > ü transition happened at the Corded Ware stage, and the Corded Ware inaugurated the pidgin of a blend of the Turkic and Old Europe vernaculars that lost agglutination and recycled a number of Turkic suffixes as prepositions that grew to independent words. That was the first draft of the “IE” family. The use off ü for both 2nd pers. sing. and pl. presents a paradigmatic transfer case shared by a number of the European descendents of the Corded Ware period.

Idioms (Examples found in etymological materials)

Numerous catch phrases are inherited from the pre-literate world, they are idioms peculiar to native languages, often untranslatable literally. Some idioms can be traced to the millenniums-old first literary works, and they are still with us. Few of them even get into etymological works, where they are translated and cited as etymological examples, and viola, some translations look like phonetic distortions of the regular catch phrases. In cases when a word does not have a single synonym, a translation is literally forced to use a cognate word, defeating any biases. In the literate era, these idioms are internationalized by the spread of literature.

English tit for tat “equivalent pain given in return” ~ Türkic tite tit, lit. “pain for pain”, an authentic Türkic idiom. The IE etymology does not have a sensible answer for this English expression, or a comic “nippel for a touch”, it is rated as a mystery. See tooth for tooth.

English tooth for tooth “equivalent given in return” ~ Türkic tiše tiš (tishe tish, Turkish diše diš (dishe dish), lit. “tooth for tooth”, a Türkic idiom. See tit for tat.

English eye for an eye “equivalent given in return” ~ Türkic közasa közas, lit. “eye for an eye”.

English (see) eye to eye “to be in agreement” ~ Türkic göz göze, Turkish köz köze lit. “gaze to gaze” (Russian calque s glazu na glaz, lit. “from eye to eye”).

English flea market “street market” ~ calque of the Türkic bit bazary “flea market”. Gmn. der Lausemarkt, Fr. marché aux puces.

A.-Sax. eorðscrafu “earth cave” ~ easily recognizable Türkic compound yerkaba, lit. “earth cave”. A.-Sax. is full of such compounds, Cf. modorcynn “maternal + kin” ~ Türkic mamü + kün, orceard “garden, lit. dug up yard” ~ Türkic or-, ore “dig, dug (ground)” + yer “earth, land”, steopsunu “step-son” ~ Türkic tep- “step” + song “son”.

See also examples in the section Phrases, q.v.: charcoal, greyhound, tooth ache, God given, I (am) sick, acidic ache, don on me, old geezer, tall (and slim) body. For any distinct semantics, every additional phoneme added to a string reduces chances for random coincidence by few orders of magnitude. Beyond 5- or 6-phoneme strings, a possibility becomes impossibility. A chance coincidence of two or more strings is infinitesimally small.

2. Morphology (comparing a few of English and Türkic morphological elements)

English -an (suff. pl.) ~ Türkic suffix -an (suff. pl.): erän “men”, oɣlan “children”, örtän “flames”. Both Türkic and English denote plurality of objects or subjects, defined in English as “weak nouns” category because they used the -an suffix. The A.-Sax. plural suffixes -u and -an are not active any more, victims of continued creolization, they were replaced with the plural marker -s, which has been extended to singulars in the old collective sense formerly rendered with the suffix -an: babes, sweets. See -s.

English at (prep.) “reference to position, direction, location, or time” (Sw201, F59, 0.35%) ~ Türkic at- (v.) basic notion “throw, shoot”, with a very wide range of extended and metaphoric meanings including directional, wide variation in the cases of the direct and indirect object, in some languages at- has almost become an auxiliary verb: daŋ (dang) at “at the dawn”, like its usage in A.-Sax. and English. Cognates: A.-Sax. aet (ət), ONorse, Goth. at, OFris. et, OHG az, Sw. at, Du. te, Afrikaans teen; Lat. ad, Romance a; Skt. adhi “near”, Gujarati ante; Ir. ag, Est. -ga, Az. -ed, Tr. -de, -da. English uses both -t and -d forms, many of them are obvious Turkism-based phrases: adapt < ad + apt (see aptitude), adage < ad + age (see age), adequate < ad + equate (see equate), and so on. The Romance a may be a relict of Burgundian Sarmat systemic truncation. Agglutinated Est. -ga may or may not be a form of Türkic -de/-da. Another European word for the directional “at” is i, used in spots by diverse languages, it may ascend to the Balkan Sprachbund of the Old Europe (e.g. Arm, Icl., Lith.). The simultaneous presence of the form at/ad in the IE languages of Europe and Asia attests to its presence in the N. Pontic as a directional auxiliary verb prior to the Indo-Aryan 2000 BC migration to the Indian subcontinent. See and, age, at, aptitude, be, equate, dawn, in, on 1, on 2, till (prep.), to.

English com- (prefix) “with, together” ~ Türkic qon-, ko:n- (v.) denoting cooperative action “with, together”. Ultimately from a polysemantic stem qo-/ko:- denoting “all, together” and “stopover”. Among other meanings, qon-/ko:n- expresses generic “to stop”, “to stay” (from sleepover to dwell) and “bunch together (stopover)”. The semantic component of “bunch together” makes qon-/ko:n- a precursor of the European com- “with, together”, its antecedent con- (contrary to claim  of a derivative), co- (coact, coaxial, etc.). That component is illustrated by allophones and derivatives: active deverbal noun konat (konot, qonat, qonot) fr. ko:n- “cumulate, bunch together, companions”, kontur- (v.) “settle, lodge (together)”, i.e. “cohabit”, konum (n.) “a group of people living close together”, i.e. “cohabit”, kop/kob “all”, koš- “conjoin, unite (two things), together”. Although the element qo-/ko:- did not survive as a stand-alone word, its presence in the derivatives is well attested. The complementary Türkic forms with -n/-m assimilation (ki:m/gi:m/kem/kemi/kemi) create a credible link to real-based etymology, offering a viable prototype notion. The Türkic etymology is necessary to demonstrate the phonetic and semantic congruity in the Eng., Lat., and Türkic languages. Cognates: Mong. qono- “to stay”, xonača “guest”, Khalk. xonoč “guest”. The IE etymology stops at Lat. as though Lat. lays at the root of the IE languages, a totally indefensible presumption based on its very late (turn of the eras) dominant status throughout the western Roman Empire. The non-attested, imagined “reconstructed PIE proto-word” *kom- is unwittingly eidetic with the documented Türkic form. The IE etymology is notable for its circular manner, it offers wysiwyg “what you see is what you get” as a prehistory, with some digressions into customary usage. The European prefix com- is a loanword within the European languages, that is attested by its limited and peculiar distribution covering an insignificant number of the IE languages. That makes the claim of a “PIE proto-word” totally incredulous. In Europe, the prefix com- and its versions blossomed into a major word-forming component, occupying a large place in the European vocabularies, a list of cognates across languages would take volumes. Many have became international words, like cooperation and convulsion. A fair number combine the Türkic qon-, ko:n- with another Türkic word, like combat (com- + bat/pat) and coapt (com- + apt). Neither A.-Sax. nor Gothic have recorded traces of Türkic qon-, ko:n-, so a path via OFr. or Lat. may be expected. In the European languages a separate word in the Türkic syntax turned into preposition (Cf. cum laude) and prefix (Cf. concubine, lit. “with stranger”). A status of preposition within Türkic grammar would seamlessly transit to the status of prefix within the IE grammar. The phonetic and semantic concordance expands the short-circuited IE etymology into the world of real linguistic processes. See con-.

English con- (prefix) “with, together” ~ Türkic qon-, ko:n- (v.) denoting cooperative action “with, together”. A sibling of com-, see com-. Ultimately from a polysemantic stem qo-/ko:- denoting “all, together” and “stopover”, among other meanings, qon-/ko:n- expresses generic “to stop”, “to stay” (from sleepover to dwell) and “bunch together (stopover)”. A derivative form konat indicates a prior existence of a complementary noun, preposition, or clitic (proclitic) con (kon, qon) conveying a notion “with, together”. Neither the A.-Sax. nor Gothic used that element, English acquired it from Lat., possibly via Norman French, thus the prefix con- is an adstrate rather than substrate, con- is assimilated form of Lat. com- . Its path via Lat. has bypassed the Gmc. phylum, attesting to an independent route and pointing to dialectally independent intrusion. See com-.

English en- (prefix) “make, turn into, put into” ~ Türkic eŋ (preposition service word) “in the beginning, first of all”, The Türkic with its allophones developed into the Lat. in- “in, into” and Gk. en “in”, OFr., Fr en- “in”. In Türkic and in English, em- is used before a p (emplace, proclitic) or b (embrace, proclitic). In Türkic and in English, en- is homophonous with in- and an-, at times they are confused, at times conflated, at times misinterpreted, adding a much wider spectrum of semantics. It is a formant for the aspect of initial time in A.-Sax. and then in English, with a notion “to engage”: A.-Sax. anaegilan “to nail”, anaeðelian “to degrade”, “anbidian” “to expect”, Eng. enchant “engage in chanting”, encharge “engage in charging”, enact “engage in action”, encamp, enrage, enable, encage, entail, etc. The IE etymology does not have any sensible explanation to its origin, the en- just appeared from nowhere, and etymologists only report the status, referring to homophones like Lat. in- “in”, Gk. en- “in”. The prefix en- should not be confused with a preposition en, an allophone of the Türkic en- (v., prep.) “descend, come down, sloping down, inclined down” used in idioms “en route”, “en rule”, “en suite”, “en vogue”, etc. Those came to Eng. via Fr. Semantically, the roles of the Türkic , Eng. en-, and A.-Sax. an- and en- are the same. In A.-Sax. en- and an- stand for “and”, “un-”, “in-”, and “on-”, with most lexicon predating Lat. influence. Similar disarray had occurred with other European languages, pointing to dialectal phonetic and semantic variations of the adopted sources. Cases of vague semantics are rather exceptions, like the insure and ensure. Undoubtedly, the Gk. and Lat. were instrumental in spreading across the European languages, but neither Gk. nor Lat. originated it nor borrowed it from A.-Sax. It is tempting to ascribe it's origin to the Sarmats of the 2nd c. BC, but the roots may extend much deeper. The eastern IE languages do not share this prefix, in Hindi and Urdu the role of the European en- is performed by particles da- and daa-, the allophones of the Türkic directional suffix -ta/da-. That makes it impossible to claim a common origin with the European languages. That sets the upper limit on the dating as post-2000 BC, the earlier date of the Indo-Arian outmigration. What unites the Türkic, English, Lat. and Gk. splits the IE paradigm. The Türkic etymology offers semantic explanation for en/en-, on/on- and in/in-, with further extensions to derivative adverbs and adjectives. The trio belongs to the mass case of morphological paradigmatic transfer from Türkic to English. See in, on.

English en- (prefix, prepos.) “make, turn into, put into” ~ Türkic en- (v., prepos.) “descend, come down, sloping down, inclined down” In Türkic and in English, em- is used before a p (emplace, proclitic) or b (embrace, proclitic). In Türkic and in English, en- is homophonous with in- and an-, at times they are confused, at times conflated, at times misinterpreted, adding a much wider spectrum of semantics. In A.-Sax. en- and an- stand for “and”, “un-”, “in-”, and “on-”, with most lexicon predating Lat. influence. Similar disarray had occurred with other European languages, pointing to dialectal phonetic and semantic variations of the adopted sources. Cognates: OFr. en-, Lat. in- “in, into”. The IE etymology does not have any sensible explanation to its origin, the en- just appeared from nowhere, and etymologists only report the status, referring to homophones like Lat. in- “in”, Gk. en- “in”. Semantically, these homophones are incompatible, one being “make, turn, put”, vs. the other “be in, toward inside”. Cases of vague semantics are rather exceptions, like the insure and ensure. The Türkic etymology offers semantic explanation for en-, on (prep.) and in (prep.), with further extensions to derivative adverbs and adjectives. See in, on.

English -er (suffix) “man, agent noun”~ Türkic er/ir/ar/härä (n.) “man, warrior, sire”. In Türkic it also serves as a grammatical postposition identical to Eng. suffix -er/-or and -man (e.g. teacher, craftsman), Cf. bošɣuɣčï er “teacher”, udčïer “shepherd”, kåjikei er “hunter”, qapïɣčï er “gatekeeper” (OTD 175). A difference between postposition and suffix here is purely conventional. The Eng. suffix indicates a human, an agent noun: teacher, butcher etc., from the Türkic root er/ir “man”. Eng. semantics extends to a general instrumental suffix: stapler, machinery. In the modern times, it grew to a global spread, Cf. computer etc. As an agent noun suffix it is widespread in Europe, with allophones -or/-ar/-ir etc., Cf. Lat. -or. In the original sense of powerfulness, might, superiority, it lives in the prefix ar-: arch-, archi-, archon, architect, etc. And in the exonym German “manly” or rather “very manly” fr. the Türkic erman “very manly” where -man is an emphatic suffix. The noun comes with at least ten semantic clusters and an astonishing variety of allophones, from ar to ey and from härä to herr and wer, attesting to deep antiquity and a rainbow of receiving languages from Atlantic to Pacific. Cognates: A.-Sax. wer “man”, here “troop, army”, ONorthumbrian -are; Dan. -ere, Sw. -are, Gmn. -er, herr; OGk. ‘ερως (eros) “hero”; Scythian (5th c. BC) eor “man”; Mong. ere “man, male”, MMong. ere “army”, Dagur er “male adult, husband, true man”; Tungus ur “man, male, sire”; Manchu erke “manly”, Jurchen ere “patriarch”; Ch. err 兒 “male child, boy” (N.Bichurin, “Collection”, Vol.1, p. 46, Note 3). Distribution of the word is truly Eurasian, from Atlantic to Pacific. In Eng. the suffix is generally used with native Gmc. words, in Sl. languages it became -el after verbal infinitive suffix -t-: vodit (âîäèòü) “lead” > voditel (âîäèòåëü) “leader”. At the Herodotus' time, Scythians called their man “er”, cited in the word Eorpata (Οιορπατα) with a part eor “man” (Herodotus IV 110). The Scythian form pata “strike” and its Sumerian precursor bir-, ber- also survived in English as the word bat. The Sumerian form bir-/ber- is attested fr. the 3rd mill. BC. The Scythian phonetic form rendered as eor reflects the Ogur articulation yer/yir/yar with a prosthetic y-/j- in the anlaut, rather than the Oguz form er/ir/ar. The Herodotus' form eor is probably a reasonable rendition of the form wer, and the name Sax, Saxon is a reflex of the ethnonym S'k “Scyth, Sak”. See bat.

English -ish (adj. suffix) ~ Türkic -čà/-čä (-cha/-che) (adj. suffix). Both Türkic and English suffixes form adverbs, adjectives, Eng. small > smallish, Tr. kičig (kichig) > kichigčà, Eng. Turk > Turkish, Türkic Türk > Türkčä. Cognates: A.-Sax. (OE) -isc, ONorse -iskr, Gmn. -isch, Goth. -isks; Gk. -iskos. This suffix is absent in other IE languages, except the Slavic-Russian, which retained exactly both phonetic and morphologic function of the Türkic -čà/-čä: e.g. logics - logical logica - logichno (adv.) logichnyi,-en (adj.) (ëîãèêà > ëîãè÷íî, ëîãè÷íûé, -åí). The Gk. allophonic form and function is just another Gk. adoption or retention of the Türkic linguistic elements. If not earlier, it could be spread across Gk. vernaculars by the N. Pontic half-Greek, half-Scythian speakers (Herodotus 4.108).

English -like (suffix) “like” ~ Türkic -lig/-lïɣ/-laɣ (suffix) “like”. The A.-Sax. suffix -lic is an exact twin of the Türkic suffix -lig. Like the English -like (antlike, beelike, etc.) the Türkic -lig is agglutinated to the stem to express a notion of similarity or possession of some property or quality: “cheek”, eŋlig “cheeky”; erk “power”, erklig (“powerful”; teŋlig “measured-like (manner)”, tepizlig “marsh-like”, etc.). A Türkic allophonic suffix is -laju/-läjü (phonetically -laü) “like”, e.g. adïɣlaju “bear-like”. Apparently, in -laju the auslaut consonant was truncated, replaced with semi-consonant, typical for the western Ogur languages. The A.-Sax. reduced the wealth of the Türkic like-type forms articulated with front and rear consonants and vowels and used according to the vowel harmony rules and grammatical conventions (-liğ/-lığ/-luğ/-lüg, -lik/-lık/-luk/ -lük, -leč/-la:č/-lıč) to a single form -lic, simplifying grammar, semantics, and phonetics. That is typical for creoles. A popular suffix, Eng. has about 750 compounds with the -like. Cognates: A.-Sax. -lic, gelic “like, similar”, -lic appears about 1100 times representing about 4% of the Clark-Hall 2011, Anglo-Saxon Dictionary (about 20,000 words); OSax. gilik, OSw. gilik, Norse glikr, Du. gelijk, Gmn. gleich, Goth. galeiks “equally, like”. Cognates are built on the Türkic model with numerous synonymic expressions for the notion “like”: îsuɣluɣ “like”, jölästürgülüg “compared, resembling”, ančulaju “such, resembling”, munčulaju “such, resembling”, and more, they convey the adjectival notion “like” formed from disparate roots by agglutinating allophones of the suffix -lig. The A.-Sax. and its daughter languages (OE, ME, Eng.) retained the functional suffix -lig/-like and extended its function to different grammatical functions of noun, verb, adj, and adv. The prefix ge-/ga-, initially “with, together”, forms past participles. It is irrelevant to the suffix -like and its allophones. The ubiquitous presence of the allophonous prefix ge- in the Gmc. languages attests that these languages at one time formed their own Sprachbund with distinct shared morphological features. Both in Türkic and English, the compound form serves as a noun- or verb-derived adjective. No IE cognates whatsoever, cognates are confined only to the Gmc. branch. The phonetic, semantic, syntactic, and morphologic similarity is persuasive, in contrast to the artificial and superficial IE attempts to derive the English -like from the homophonic (lic, lich) but totally unrelated “body”, “corpse”. Such intractable suppositions heighten doubts about credibility of entire IE paradigm. The element -like is a salient member of the substantial body of morphological elements constituting a massive case of paradigmatic transfer that vigorously attests to its Türkic origin. Not formally accounted against a frequency usage of the English Turkisms, they still add mightily to the proportion of the Türkic inheritance in English, Cf. 4% of the Anglo-Saxon vocabulary, probably much less in usage frequency contribution.

English omni- (prefix) “all, everywhere” ~ Türkic omqï, omqu, umax, yimaɣ (adj.) “all”, lit. “collected, gathered”. Ultimately a derivative of om-, yom-, um-, yum- “collect, gather” formally rated as long-dead “Proto-Türkic”, but richly furnished with presently active derivatives. The out of circulation word omqï/umqï and its allophones (yomqï/yumqï in Ogur articulation) is an archaic word recorded in the OT and early Middle Türkic periods. The omni-/omqu and total/tutuš share their fate of both being the Türkic relicts, both came to Europe with the Latin predecessors. During the Middle Ages, on the British Isles, they rejoined their originally Türkic A.-Sax. synonyms. On top of the somewhat murky Türkic archaic inheritance, the trace of the Lat. relict allows to gain a view of the Proto-Classic processes. Eng. has about 100 compounds with omni-. Cognates: Lat. omni-, omnis “all, every, whole, every kind”, rated “of unknown origin”, but with a pedigree of the modern international word disseminated by the modern science. The A.-Sax. composite ymbcyme, ymcyme “assembly, convention” with cyme “come” may reflect the Türkic om-“collect, gather”, lit. “come to gathering”. The IE etymology came up with a speculation of ascending to the notion “work, produce”, based exclusively on an alleged phonetic consonance, quite a desperate proposition. In contrast, the Türkic-based etymology is supported by direct phonetic and literal semantic correspondence. All in all, English is using five of the Türkic 32 ways to say “all”. That is quite an impressive paradigmatic transfer. See all, entire, gamut, total.

English on 1 (prep., postp., adj., Norse adv.) “on top of” (Sw N/A, F26, 0.73%) ~ Türkic ö:n, öŋ (n.) “top, front (frontal part)”. Ultimately a semantic extension of the noun ö:n “front, forepart, frontal, beginning”. A connection is, for example, in the perception that the top of a well is its front. Besides “top, front”, the polysemantic öŋ carries another eight semantic branches with their own derivatives, e.g. öŋ “before, previously”, see once. With a signal economy of material, Türkic has a complementing contrasting cousin in/en (n.) “bottom, descent”, see in. It is reasonable to admit that öŋ and in/en are coevals from the same archaic period. Except for few Türkic languages, the notion “top” was largely supplanted by the words jer, orü, üst, üzä, and forms with directional suffix -ta/-da, Cf. Türkic onda “there, then”, tepede “on the top” (töpü “top”), Cf. Eng. onto “to on, on top of”. The ö:n/öŋ lives on in the derivatives, Cf. öndün “ahead, in front, forward, advanced”. In Eng., the fuzzy and fuzzily differentiated A.-Sax. on “on, at, in, to, for” was widely supplemented by a cousin form “in”. Like with in/en, the Eng. on is a universal preposition, postposition, adverb, and adjective, with a wide spectrum of semantic meanings: about, along, during, in, upon, with, active, etc., something activated or coming along. In some applications, a blurry semantics of both on and in overlaps, confusing a grammarian and homegrown etymological specialists. The process of speciation continued well into the 2nd mill. AD. The A.-Sax. on carries on the semantics of the Türkic ö:n “top, front, beginning”, but he semantic and grammatic haziness are symptomatic for the entire Gmc. group. Cognates: A.-Sax. aen, an, on (prep.) “on, upon, during, at, about, against, towards, according to”, (adv.) “forward, onward, continuously”, (prefix) aen(laenan), on(laenan) “lend, grant”, onbaec “backwards, back”, Du. aan, Gmn. an, Goth. ana “on, upon”; Lat. an-; Gk. ana (ανα) “on, top”, ano (ανω “up, top”); Av. ana “on”; OCS na “on, upon”, nai- (prefix) “top”; Mong. ӧnge “frontal part”, ӧmïne, emüne “front (frontal part)”. Allophones of the forms on and pa (Sum. ba “in”) predominate across Eurasia. Location of the ba- is peculiar: Dan., Sw. pa, Fris. op, Fin. päälle, Hu. tovabb, Rum. pe. The Türkic form ö:n “top” has survived in the Uigur and Uzbek languages of the Karluk group. The directional on, in, and at are notoriously confused in daily life, vernacularly adding to the semantic spectrum. A direct comparison of cognates is etymologically muddled, similar phonetic forms are semantically conflated, and any type of movement in any direction is thrown into a same phonetic pile with little attention to its key message. There is no “IE” sensible explanation for the origin of on. The ideas of the faux mechanical constructs “PIE proto-root” *h2en- and/or “PG” *ana “on” are dubious and not needed in light of the attested Türkic originals. These falsifications need to be rescinded in respect to the principals of science. That leaves no “IE” etymology, the word came from nowhere, both grammarians and curtailed-purview etymologists act as confused observers. A similar IE etymological cacophony surrounds the origin of in (prep.). The Türkic etymology offers semantic explanation for both on (prep.) and in (prep.), with further extensions to adv.s and adj.s. Both prepositions have clear and discrete semantic meanings, i.e. aiming to a higher status vs. aiming to a lower/inside status. An absence of the Celtic cognates and the Gmc. convergence of the European cognates attest to the absence of the word at the time of the Celtic departure from the N. Pontic ca 5000- 4000 BC. The presence of the word in the Av. attests to its European presence at the time of the Indo-Aryan departure ca 2000 BC. That points to the Corded Ware hub as a source of the European and South-Central Asian internalization. A healthy half-dozen of the Eng. directional prepositions irrefutably lead their origin from the Türkic milieu. The Türkic origin is the only realistic outcome. See and, at, be, in, on 2, till (prep.), to.

English -s/-es (suffix pl.) ~ Türkic suffix -z (suffix pl.), Chuvash -sem. Both Türkic and English suffixes denote plurality of objects or subjects and marks possessive case, see -'s possessive”. The Türkic -z is archaic and now is present only in some words, like I vs. we, but is still most active coupled with other suffixes or particles: biz, -ŋïz/-ïŋïz, -dïmïz/-tïmïz, etc. Other archaic Türkic plural markers are -t/-ty, and -an (-lan) denoting collectivity: bajaɣut “affluents”, oglan “boys” Cf. cognate clan. Cognates: A.-Sax. -as: dæg “day" ~ dagas “days”; Du. -s plurals, Scandinavian -r plurals (rhotacism): Sw. dagar “days”. In Türkic, rhotacism is connected with the Ogur (Western) languages: Scythian, Sarmatian, Hunnic, Bulgar, Tatar, Halaj/Alat, etc. Under the IE paradigm, the Gmc. –s/-r declension is a peculiarity traceable to the original PIE inflection system. It is a pure nonsense given its Eurasiatic spread reaching from Atlantic to Pacific, and sporadic nature among IE members. Both Türkic and English also denote plurality of objects or subjects with the suffix -an, defined in English as “weak” nouns (i.e. regularly inflected) category because they used the -an suffix. The A.-Sax. plural suffixes -u and -an are not active any more, victims of continued creolization. The process is not over yet, the plural marker -s has been extended to singulars in the old collective sense formerly modified with the suffix -an: babes, sweets. Both Türkic and English did not use plural markers if plurality was conveyed by other means: 3 sheep, 6 o'clock, 2-pound note, 7-year period. The continued creolization tends to add plural marker -s to these plurals: 3 sheeps, 6 o'clocks, 2-pounds note, 7-years period. See -an, -'s “possessive”.

English -'s “possessive case suffix” ~ Türkic -si “possessive case suffix”. Both Türkic and English suffixes indicate belonging of an object or subject to a 3rd person singular: Türkic ma:masi, mamüsi, annesi “mother's”; ata, dedä, baba “father's”. Cognates: A.-Sax. –es: fæder (sg.),  fæderes (gen. sg.) “father’s”, windel (sg.), windles (gen. sg.) “basket’s”. The codified suffix -'s is a contraction of A.-Sax. -es < Tr. –si, the apostrophe “’” is a conventional scribal for -e- unrelated to the verbal form. Other A.-Sax. suffixes -e, -re, -an (gen.), -a, -ra, -na (pl.) etc. have vanished. It is peculiar that the -'s/-es /-si has survived while the others have vanished, a similar possessive marker may have been used by the prior local populations. See -s/-es (suffix pl.)

English to (prep.) “verbal marker of infinitive case” (Sw N/A, F3, 3.12%) ~ Türkic tur-/dur-, Chuv. tu- (v.), a truncated form, “do, act”. The CT polysemantic verb tur-/dur- conveys notions of “intent or readiness to act, duration or permanence of actions”, among other derivative notions; grammatical function verb and auxiliary verb of action, functionally eidetic to the Eng. pair to (infinitive case) and do. Cognates: Dan. at, Du. te, Norse a, Sw. att, Gmn. zu. Essentially, to is an allophonic form of the verb do, the compound form to do is tautologic, repeating the verb do twice, in different phonetic forms. The stem tö-/tü- serves in numerous Türkic derivatives expressing semantics of “make, made”: tükät “completeness, completion of action”, törü- “happen to occur, emerge, be born, appear, give birth”, törüt “create”, törči “happen, occur, undertake, initiate”, törči also serves as auxiliary verb exactly like the English do, with a similar complement of functions: “make, engage, carry out, carry on, get done, proceed, cause to happen, engage in, comport, execute, finish, complete action” with idioms and nuances. In the compound to do the part do “do, act” originated within the western Türkic Sprachbund vernaculars as an allophone of the verb tö-/tü-, ascending at least to the 1st millennium BC, while the part to “do, act” was a later accommodation to the typology of the IE languages, replacing the agglutinated verbal suffixes that made clear the verbal use of the stem: to act, to pack, and the like, versus nominal use of act, pack, and the like, lit. do act, do pack. In its syntactic function, to parallels the function of the noun articles, it is a determiner that indicates the functional specificity of reference. The IE etymology conflated the directional particle to “in, into, towards” with the verbal determinant to, assuming that in spite of drastically different function the phonetic likeness indicates a common origin. That speculation on Eng. peculiar independent semantic shift conflicts with the presence of cognates in other Gmc. languages. Instead of recycling a functionally misleading directional marker to to denote a verbal function, English seamlessly adopted as a verbal marker a verb to meaning “do”. Chuv. is held as a language of the Hunnic circle, that allows to time the introduction of the alternate form tu- for the verb do to the 4th c. AD or Sarmat expansion in the 2nd c. BC. The frequency value is somewhat exaggerated because the source listing does not discriminate homophonic adj.s and ratios, but impacts of those contaminants are negligible. See and, at, be, do, make, in, on 1, on 2, till (prep.).

English un- (prefix) “negation” ~ Türkic -an (àŋ, ang) (suffix) “negation” (MK I 40). With the ancient English language speakers switching to the creole morphology of the new language(s), the old negation suffix moved to become a prefix, and the original negation suffix -an (àŋ, ang) had vanished as a suffix. The set of negations an and ma (the Fr. negation pas /pa/ in ne... pas may be an emphatic tautological allophone of ma) appears to have a “Nostratic” primordial pedigree. They appear as prefixes and suffixes depending on the typology of the languages, and include uncounted allophones and transpositions. See me, my, no.

3. Verbs (226)

English access (v., n.) “entry, opening” ~ Türkic ačsa:- (achsa:-, v., desiderative), ačıš- (achısh-, v., co-operative) “open”. Ultimately a derivative fr. the verb ač- (ach-) “open”. The notion “open” forms a first semantic cluster out of total 16 clusters. Its elucidation lists 39 meanings, from “open” to “sharpen a pencil”. The other 15 clusters are much more modest, from 3-4 to a single meaning, spread across dozens of Türkic languages. Among those are notion #5 “conquer, capture” and #10 “start (war, process, discussion)” reflected in the Fr. semantics (Sevortyan 1974 vol. 1 p. 209). The convergence of the Lat., Fr., and Eng. versions in distinct semantic and phonetic aspects indelibly point to a single Türkic origin. English, like Türkic, has numerous semantic meanings, few literal and many metaphorical, at times unexpected (e.g. open door and open season, open sky, gift “open the door with a gift”, open mind “non-prejudiced”, open-ended “changeable”, etc.). The transition fr. ach- to ac- (ak-) is regular, in A.-Sax. -c- indiscriminately stands both for -k- and -ch- (e.g. tacan (tachan) ~ teach). Cognates: OFr. acces “onslaught, attack; onset”, Lat. accessus “entrance, coming to, approach”; Finn-Ugrian az “opening, hole, pit”, Vogul as “hole, pit”; Mong. asa “split”, Kalmuk ats “split point between brunches”, ditto Kaz. aza; Manchu asa “join, converge”, asap “conjoining (joining, conversion point of mountain ranges)”. Distribution is not limited to a spec of W. Europe, it extends from the Atlantic to the Pacific, way beyond the myopic sight. It crosses linguistic barriers, including those with the European Romance branch. On semantic side, “IE” cognates demonstrate a disconnect that can be only explained via a source that carries both an “entry, opening” and an “attack”, a scenario that ascends to the Türkic verbal stem ač- (ach-) “open”. On phonetic side, the IE etymology fictionalized a Lat. compound ad- “to” + cedere “go, move” with a transformation from ad- to ac- (ak-) and “entrance” to “move”. It conflated that with the Fr. separate semantic “onslaught” distinct from the Eng. concise “entry, opening”. That is a typical etymological technique of creating brand new IE proto-words on a fly. Unwittingly, that machination leads directly to the Türkic ač- that statically makes impossible an existence of an analogous word in any other earthly language if not in the entire universe within a radius of 10,000 light-years. It has to be either a loanword or an imposter. The IE claim of an open road to nowhere prevents a promising ethnolinguistic research of the available material, like an association of certain semantic clusters with certain languages. That, in turn, is a potent diagnostic tool to trace the origins of the English language and contributing routs, a sole task of the etymology.

English act (v., n.) “doing or done” (Sw N/A, F735, 0.03%) ~ Türkic aqtar- (v., causative) “raid, dunk, plunge” akıt- “raid” Ultimately fr. the verb aq-/ak- with a prime notion “(river) flow, stream, jet”, a fistful of secondary notions (“liquefy, pour, bleed”), and a wealth of few direct and a wealth of metaphoric derivatives, one of which is verbal “attack, assault, raid” (also “carried, float”, connected with the notion “move, stream”). The initial origin from “(water) movement” and “cattle rolling over” appear to be sound interpretations. Like the Eng. “flow”, aq-/aq grammatically comes in noun-verbal homomorphic pairs, attesting to its most archaic pedigree. On the surface, without minute analysis, numerous derivatives of aq-/aq appear to be homophones unrelated to each other. The others, incongruent semantically, are held as true homophones, e.g. “white”, “net”, “elder, aged”, “rake (glance)”, etc. In addition, even for the same word, Türkic has numerous phonetic and rendering variations ağtar-/axtar-/aktar-/akdar-/agdar-/agnar-/agar-/ağna:- etc. It has numerous concrete extended meanings “overthrow”, “turn or roll over, “knock over”, “lay low something”, “examine”, “search”, “translate”, some quite loaded, like “sudden raid at night”, fairly in line with the spectrum of Eng. 200+ applicable meanings. The inlaut alternation -ğ-/-k-/-q-/-x- allows semantic shifts and breeds dialectal diversity. The loss of the final element -er shows that the internalized root was perceived as aqt. In English, a derivative -act grew into a powerful suffix: contract, impact, react, transact, etc., with a slew of their verbal and object derivatives. In one or another form, the word gained universal acceptance in most languages of the modern world. Cognates: A.-Sax. agan, aegan, agangan “go, go by, pass”, ONorse aka “to drive”; MIr. ag “battle” (Türkic “raid, attack”); Lat. ago “put in motion”, OFr. acte “document” etc. obviously of derivational origin; Skt. ajati “drives”, ajirah “moving, active”. Distribution is like needles in a haystack: too many meanings, too many possible languages. The IE etymology stops at Lat: Lat. actus “a doing, a driving, impulse”, pp. of agere. Its circular logics and an unattested faux PIE stem *ag- were mechanically deduced from various derivatives in Lat., Gk., and Skt. The Skt. examples mingle roots and suffixes, Cf. -ti, Türkic 3rd pers. suffix; the root is aja/aji. The Lat. act- likely appeared as a reflection of the semantics “move”or “raid”, an echo of the MIr. “battle”.

English aggrieve (v.) “feel bad”, aggravate (v.) “make worse” (v.) ~ Türkic aɣrï-, aɣïr-, aɣru- (v.) “feel bad, be sick, make sick”, “grow heavy, become a burden”. Ultimately a derivative of a verb a:ŋ- “lean to one side, outweigh (on one side), hang down”. Grammatically aɣrï-/aɣrï- is a noun-verbal homomorphic pair, attesting to its primeval origin unrelated to the consonant gür (n.) “grave”, see grave. The Türkic stem is most productive, applied with anything unpleasant, and in that respect closely parallels the English “sick” and “grieve, onerous” respectively. The Türkic linguistic nest has aɣrïɣ “pain, sickness”, aɣrïɣlïɣ “sick, painful, suffering from disease”, aɣrïɣučï “suffering person”, aɣrïmaqlïɣ “painful”, aɣrïn “suffer pain”, aɣruq “load, weight”, “seriously ill”, and so on. In the A.-Sax., with retained first vowel, it wielded a notion of beginning or transition form a lighter state to a heavier state by interpreting the anlaut a- as an unemphatic verbal prefix marking verb as a momentary single event, Cf. A.-Sax. abakan “bake”, aberan “bear”. With aphesis of the unaccented first vowel, the root ɣrï-/ɣru- produced a bifurcated host of grievous derivatives in the European languages: grave (adj.), grief (~aggrieve), grim, grimace, grime, gravity, gravitate, gravel, and all their derivatives with semantic meaning of “(pain, trouble, heavy) burden”. Cognates: A.-Sax. egile, egle “grievous, painful”, ONorse uggr “dread”, Goth. aggwifa “anguish, distress”, aglus “hard, difficult”, Gmn. verärgert lit. “real grief”; Lat. aggravatus, aggravare “make worse”; Sl. gore (ãîðå) “anguish”. Distribution is ubiquitous across NE Eurasia, spotty and vague in Europe. In Europe a bulk of cognates are Gmc., pointing to two independent etymological paths, one via Gk.-Lat., and one Gmc. Even the latest derivatives, like the modern noun “gravity”, echo the proto-root “pain, burden”. The IE etymologies are few and far between for the huge raster of the derivatives, with imaginative “PIE reconstructions” for some derivative forms, without any cited actual cognates in the Asian IE languages. See grave.

English aim (v.) “direct, target”, (n.) “target” ~ Türkic amač (n.) “aim, target”. The word amač is formed with the noun am and a diminutive denoun noun suffix -ač/-äč/-uč/-üč, lit. “small pussy”, quite appropriate and not unexpected title for the exercise. Türkic derivatives have suffixes stripped by English grammar: amačla “to target, aim”, amačliq “target range”, etc. Cognates: None IE. The native A.-Sax. word was targa, targe and was a term for a small shield used for practice. It survived to the present, not supplanted by the newcomer aim. The suggested IE cognates are phonetically dubious and semantically unrelated: “estimate”, “value, rate; count”, “plow”, etc. They conflict with simple and vitally important training requirements of the past millenniums. The absence of the IE cognates indicates direct continuity between the Türkic and English. That suggests that the OFr. word came fr. the military lingo of the European Türkic tribes (Cf. Burgunds), while the Pers. word hadaf for “aim, target” came from the Semitic Mesopotamian sources. The suggested Pers. word for “plow” as a loanword for the Eng. “aim” does not make any sense: the nomadic pastoralists and hunters had to aim millenniums before the Persian cradle of the Corded Ware times. The winded wordy rationalization used to cross from the “plow” to “aim” shows a futile linguistic exercise based on some homophony. The phonetics is direct, and semantics perfect. Rather, the Türkic source is likely a metaphoric use of the noun am “vulva, pudenda muliebria”, quite appropriate for its time, for near exclusively male participation, and for non-offensive nature of the word in the world where sexual reproduction of the herds was a daily affair guided and assisted by people.

English ambush (v., n.) “lying in wait, attack by surprise” ~ Türkic buš-/bus-/baš-/bas- and puš-/pus-/paš-/pas- (v.) “push, wound”, busuɣ, pusuɣ (n.) “ambush”. The base meaning of the verb buš-/puš- with all its allophones is “push, press”, with innumerable derivatives, of which the “sudden attack” is philologically just one of many, but in the Türkic warfare a cardinal term for a tactics that over the centuries overpowered nearly all states from Pacific to Atlantic, it was invariably attested for the armies from the Scythians to the late Türks. Cognates: A.-Sax. basnian, OFr. embuscher. The late Fr. word is apparently an accommodation of a Burgund form with then Fr. prefix em-, the original busher with the directional suffix -er/-r conveys the notion of “into an attack”. The Eng. form conflated with the Fr. form and replaced the A.-Sax. native noun basnian “wait (in ambush)” and the corresponding verb of the same root. That the Eng. and Fr. words come via separate paths fr. separate vernaculars is attested by the A.-Sax. absence of the prefix and phonetic differences of the same root, the bush- vs. bas-. The IE etymology is misleading and plain silly, fails to connect the dots, it ignores or does not know the native basnian, and suggests a juicy Fr. version of some act “in the bushes”. See push.

English anger (v.) “feeling toward some grievance, vex” ~ Türkic özak (adj.) “narrow”. The attested link is Türkic özak “narrow” > Goth. aggwus “narrow” > A.-Sax. enge “narrow, painful” > anger (v.) “to irritate, annoy, provoke”. The semantic shift happened in front of our eyes, otherwise no etymologist would ever come up with such unruly phonetic transformation and disconnected semantic transformation. The Türkic özak (adj.) is a derivative of öz (n.) “valley, pass between mountains”, hence a narrow passage, narrows. The semantic of “narrow” is also preserved in MDu. enghe, Lith. ankshtas, Lat. angustus, Sl. uzkii, vuzkii, Arm. anjuk, Skt. aihus, aihas, Av. azah- “need (i.e. tight circumstances)”. Before undergoing semantic innovation in English, the word spread with its original meaning of the Türkic öz “narrow pass” to the South-Central Asia at about 1600 BC, and with the Kurgan or Sarmatian waves to the NE Europe and SC Europe (Lat. angustus “narrow, tight”). The syllable öz comes in numerous flavors, öd, öδ, öz, üz, making the Goth. form aggwus and Sl. uzkii, vuzkii just another attested dialectic forms, clearly of separate linguistic branches. The IE etymology does not dig to the base stem of the IE forms, stopping at a limited sampling of allophonic forms. The Lat. derivative angustia “narrowness, tightness, straightness” of the Türkic özak (adj.) produced the English anguish, another allophone of the A.-Sax. enge. See anguish, narrow.

English are (v.) present plural indicative of be “to be” (Sw N/A, F36, 1.20%) ~ Türkic er- (v.) “be, am, are, is, exist”, conjunctional, noted accompanied by a predicate; it cannot be used by itself to mean “to exist”, but is often used as an auxiliary verb after participles, etc. of other verbs (Clauson, p. 193), and as deverbal and denominal suffix -ar/-er (with phonetic variations) with meaning “be, to be”, forming nouns and adjectives of the type eye + be > “ocular, visible” (körünü:r, fr. kör (n.) “eye”), see + be > “ocular, visible” (okunur, fr. oki:- (v.) “see”, see eye). In Türkic, the suffix forms -ar-/-är-/-ur-/-ür-/-ir-/-ïr- produce derivatives of the type X-be, still preserved in English in the forms “that be” (e.g. “powers that be”), “we be”, “us be”, “blessed be”, “boys be”, and the like. The Türkic applications, in addition to the above, are tutar “caught be/be caught” < fr. tut- “catch”, kelir “come be/bring/convey” ~ came < fr. kel- “to come”, ketär “depart be/take away” ~ remove < fr. ket- “leave, depart”. Cognates: A.-Sax. ear(un) (the form ear- (ər-) may reflect dialectal variation of the vowel), ar(on), Mercian ear(un), Northumbrian ar(on), Norse, Dan. er-, Sw. är; Icl. er(um), Welsh ry(dym); Tr. er(tim). The Türkic cognate is cited to illuminate the relict case of paradigmatic transfer of the Türkic 1st pers. marker -m encountered in the above Celtic words. According to the standing IE etymology, the are supposedly has no IE cognates whatsoever, but cf. Sp. yo era, tú eres, eras, él/ella/Ud. era, nosotros éramos, vosotros erais, ellos/ellas/Uds. eran. The Lat. suffixes mimic Türkic suffix -ar: -eram (with suffix -m), -ero. The genetic connections between all these examples are indisputable. The European (and Türkic) languages have three main stems for “be”, er-, es-, ol-:, and be-: Balto-Sl. (Latv.) esam (with suffix -m), (Lith.) esame (with suffix -m), (Sl.) est (åñòü); Est. oleme (with suffix -m), Fin. olemme (with suffix -m), these Uralic forms are visibly acquired versions ascending to the Türkic er-; Gmn. bist, Fris. binne, Az., Turk., Kaz. biz, Mong. bid, these forms ascend to the Türkic bol- (v.) “be”, see be. The origin of are is supposed to be a puzzle, a screaming enigma in the IE paradigm. It does not fit any scholastic IE schemes. It is an irregular, demonstratively non-IE verb with all verbose conjectures befitting a great scientific conundrum. The origin, however, lays on the surface: a Türkic form of the verb “to be” or a matching agglutinative suffix that produced derivatives expressing an existence (being, presence, fact) of a property, action, or a trait, in flexive surroundings used either as a stand-alone application or in a form of agglutinated suffix. The origin reliably points to the substrate language with an incompatible grammatical structure. Apparently, in A.-Sax. be and are were used interchangeably, depending on the analytical semantics of general vs. singular. Notably, both Türkic and English have salient identical bifurcated semantics of the stem ar/är (ər)/er. In Türkic it stands for “be” and “man”, in English it stands for “be” and “man” in the form of instrumental suffix (teacher, dealer, etc.); such a transfer of bifurcated semantics is a case of paradigmatic transfer indelibly attesting to the common genetic origin. The distribution of the verb are is peculiar, the Lat. form corroborates the pra-Celtic influence on the pra-Italics, the northern European cognates and the Türkic cognates are consistent with the spread of the Kurgan's R1b haplogroup. The spread of the bifurcated “be” leg is incompatibly more directional and focused than the dispersed spread of the “man” leg, and all seeded languages are adjacent to the Eurasian Steppe belt. See -ar, be, make.

English argue (v.) “quarrel”, “proving or refuting” ~ Türkic arqu-, arqula-, arɣula- (v.) “discord, disagreement, strife”. Ultimately fr. the polysemantic verb ar- “separate, part” for a piece that bridges two parts, a:ra (n.) “middle, center”, its deverbal noun derivative arqu: “ravine (valley)”, and finally its denoun verbal derivative arqula- “sow discord”, lit. “cause ravine” between two sides. The semantics of the root ar- is “in between”, like an arch connecting two sides, with “intermediary” bifurcating for positive and negative derivative notions, in this specific case a rather negative connection. The suffix -la makes an instrumental denoun verb. The verb ar- has a version az-/as- of the r/s rhotacism, attesting to a deep age of the stem. It also has forms with prosthetic anlaut consonant ha:r-, ha:y-, xa:r-, and anlaut vowel versions u-/ï-. Where the EDT sees mostly a positive, like a “messenger”, the OTD notes a homophonic negative. By now it is an international word via the scholarly argument. Cognates: A.-Sax. arteflan “separate, divide”, OFr. arguer “maintain an opinion or view, harry, reproach, accuse, blame”; Lat. argutare “blabber” (negative), arguere “make clear, make known, prove, declare, demonstrate” (positive); Hu. ervel “argue”; Mong. aračila “plead for somebody”, marga(dar) “argue”; Hittite arkuuae- “plead”; Kor. phari ha “emaciated, gaunt”; Finno-Ugrian languages also have correspondences with the stem ar- (Nemeth, 1928). The Mong. form has a prosthetic anlaut m-. The significance of the Kor. form is that it is an alternate meaning of the same verbal stem ar- in Kor. articulation. No IE etymology, the suggested faux “PIE proto-words” *argu-yo-, *arg- “shine; white” are semantically unrelated and unsustainable. The cognate distribution crosses linguistic barriers and is spread across Eurasia from Atlantic to Pacific, a feat patently unreachable by the Lat. It also involves Hittite that is far older than the Lat. and is not related to it. The Türkic etymology reaches down to the prime stem, is semantically and phonetically perfect, and is corroborated morphologically by correct morphological elements, the suffixes -qu and -la. See arch.

English assign (v.) “allot, arrogate, ascribe, delegate, portion” ~ Türkic asïɣ (n.) “benefit, interest (on a loan), gain, profit, advantage”. Ultimately a deverbal noun derivative fr. a root as- “hang”, its semantic extension “stick, stuck”, and its verbal form asït- “profit”. The Türkic notion is “benefit” in a form of interest, gain, profit, or not necessarily material advantage (like advantage of knowledge). The European core notion is “assignment of benefit” in the same forms (interest, gain, profit). Usually, but not necessarily, it is in a form of a legal document, best exemplified by the term assignation “grant a right (for benefit)”, related to transference of property or a paper obligation to receive principal and/or interest. A shift from nominal to verbal predicate could be a western innovation, the eastern records cite only a noun form. Cognates: A.-Sax. asegendnes “offering”, est “bounty”, OFr. assiginer (v.) “assign, appoint, allot”, Lat. assignare (v.) “assign, allot, award”; Welsh aseinio “assign”; Fin. ase “get stuck”; Mong. asa “stuck, adhere”; Manchu hasi “hang”, a precursor of “stuck” a precursor of “gain”. A complimentary term “allot” also has Türkic cognates: Tr. atamak, Azeri etmək. The base forms assignation, assignatio and assignacion demonstrate a fully articulated -g (-ɣ/-ğ in Türkic case). Both the Türkic semantics and phonetics are perfect. A.-Sax. internalized a fossilized third-generation derivative in its trade semantics. The IE etymology offers a phonetically dubious Lat. compound of ad- + signare (signum “mark, sign” > adsignare > assign). The chances that precisely the same very specialized trade word would arise independently are nil because the semantic field width is extremely narrow (“getting a benefit”). The IE etymology would imply that most remote Siberian tribes had adopted a Lat. compound invention that also managed to successfully escape nearly all IE languages. The Türkic etymology has an advantage of being direct, simple, and eidetic phonetically without a need for a chain of phonetic transformations. Apparently, English has received this Turkism via French independently and without conflation of its inherited A.-Sax. cognates. See attach, assessment.

English attach (v.) “bind” ~ Türkic atka:-, atkan- (v.) “attach, stick, cling”, a denoun verbal derivative of atka:ğ (n.) “barb, burr”. The ultimate origin is of at “horse” in adjectival form with a directional suffix -ka, lit. “(stuck) to horse”. Any agglutinative language naturally forms derivatives of “horse” (here, at) semantically related to grass, burr, and the like (Cf. Sl. generic loshadinyi (ëîøàäèíûé) “horse's” with the Türkic suffix -in/-an “of the horse”). The connection is to burred grass seeds atka:ğ that cling and hurt herds. Cognates: A.-Sax. aetcilfian, aetcilfian “adhere”, aeteaca “appendix”, aetfele “adhesion”, OFr. atachier (11c.), It. attaccare. No IE connection whatsoever, no trace in different IE branches. A suggested “PG proto-word”, Frankish *stakon “post, stake”, is beyond absurd. The palatalization -k- > -ch- is a Sprachbund trait of the languages in the western steppe zone, the It. vs. OFr. examples attest to normal dialectal variations and co-existence of palatalized and post-palatal velar forms. The “late” appearance of the documented word parallels a host of other English words that the grammarian strata did not have any attachment to. A.-Sax. had an impressive trail of derivatives, attesting to a long history of the internalized word within the A.-Sax. milieu. Some of these words are compounds of two Türkic words: aetfeolan “stick, adhere, continue” < atka:- “attach” + bilin- “feel, like” > “sticky, gluey” (see feel), aetfele, aetfeolan “adhesion, adhere” (with semantic extensions “continue, practice”). Others are a blend of the Türkic atka:- and a European word: aetgaedre “united, together”, aethabban “retain”, etc. In compounds the second consonant -k- is routinely elided. The form aetheaca attests to a divergent pronunciation of the sound -t-. Such internalization, amalgamation, and divergence point to a millennium-long process, which potentially brings the Anglo-Saxon union back to the middle of the 1st mill. BC, and internalization to the Corded Ware period. See feel.

English augur (v.) “predict” ~ Türkic ay- (v.) “tell, talk, say, explain, nterpret”, ayi (n.) “sound”, ayığ (n.) “word, speech, command”. The base notion is “voice, vocalize”, hence it is used for emphatic expressions like predictions, praise, and accusations. In the earliest period ay- to some extent was a honorific “declare”. The Türkic 3rd person form is ayur/aygur/ajar, aygu (ayguc) is “advise, explain”, with an instrumental sufix -či (-chi) it is ayguchi (ayguchi) “teacher, adviser, councilor”. Gmc. languages routinely articulate it with a prosthetic w- or its equivalents, and with negative overtones. Cognates: A.-Sax. weman (v.) “voice, announce, be heard”; Lat. ai(i)î “declare, assert”, augur “religious official” foretelling events, auguro “prediction”; Goth. waya (waja) “voicing, expression (blaspheme, slander)”; LAv. ad- “tell, say”; Mong. àyà, ayas “sound, sounds (especially of speech, pronunciation, accent, rhythm, tone”. Distribution extends from Atlantic and Mediterranean to the Far East ends, crossing linguistic barriers and lands which a Latin sole never trod. The Türkic aiguchi'es “adviser, councilor, chancellor” accompanied all Türkic epical and real monarchs. Tonyukuk of the Orkhon runic Inscriptions was an Aiguchi. The dumbfounded IE etymology stops at a single Lat. instance, citing as cognates a few mechanically applied nonsensical phonetic examples (avis “bird”, garrire “talk”, augos “increase”). Semantic and phonetic match is perfect. In a feat of paradigmatic transfer, Eng. possesses five main action words related to communication: declare, say, tell, call, and gabble, the direct siblings of ay-, söy-/söyle-, til-/tili-, qol-, and gap-/gapir-. Although overlapping and interchangeable to some degree, each one conveys its own spectrum of very basic communicative notions. The origin from the Türkic milieu is indelible. See augur, call, gabble, say, tale, talk, tally.

English awe (v., n.) “to be inspired, deep fear, reverence” (Sw N/A, F1140, 0.01%) ~ Türkic ö, ö-, a-, u- (v.) “think, reflect, delve, understand”, with a: (’a) carrying a flavor of “astonish, amaze, astound”.; u- is an allophone of ö-/a-. A single-phoneme allophonic line produces overlapping derivatives with diverging outcomes, e.g. adar- (-dïr-, -δïr-, -tïr- -ŋa, -ŋla-), îdɣur- (-δɣar-, -δɣur-, -jɣur-), etc. A use of non-rounded initial vowels extends the allophonic derivational field further. The extent of modifications involves nearly entire store of consonants: öb(kälä-) “angry”, öč “revenge, anger”, öδ(ik) “passion”, öf(kä) “anger”, ög, ök “intellect, thought” with the standard Türkic deverbal noun suffix -g/-ɣ, and so on. The words ay “oh, ouch, ow, ouh, aah, uh-oh, ugh”, añığ “excessive(ly)”, ayman “fear, timid, faint”., etc., appear to be the derivatives of a:-, ö-, and u- expressing a negative excitement of awe. Of those, the ayman, with a relict deverbal noun suffix -ma, portends the awe of fear predominant in the Middle Age A.-Sax. lexicon. The Türkic noun derivative ög, ök formed with the standard Türkic deverbal noun suffix -g/-ɣ became the A.-Sax. eg, ege and its Gmc. allophones. The A.-Sax. blurry transcription eg-, aeg-, eag-, ieg- simultaneously attests to a wide spread of the word, attempts to render the rounded ö- with the tools of the Roman alphabet, and an accuracy of the spelling awe to render the rounded ö-. The final victory of the form awe (ö-) over the A.-Sax. phonetic variety attests to the demographic significance of the original Türkic component, probably numerously reinforced and reinvigorated by continental refugees fleeing Middle Age religious persecutions like that of the Cathars. Both Eng. and Türkic have preserved the noun derivative ög/ök “mind, thought, understand”. Cognates: A.-Sax. aege, ege, oga “awe, fear, terror, dread”, aghe, ege “fear”, egesa “awe, fear, horror, peril”, ONorse agi “fright”, OHG agiso “fright, terror”, Goth. agis “fear, anguish”; Ir. ua(mhnach) “awesome”, iontas “astonishment”, Gael. obha “awe”, iong(nadh) “astonishment”, Welsh ofn “fear”, anhy(goel) “astonishment”; Latv. aizplust “astonishment”; Sl. ah, ahnut (àõ, àõíóòü) “astonishment, fear”; Gk. akh(os) αχ(ος) “pain, grief”. The early literary examples apparently are heavy on religious admiration and fright of God, while the “awe” is a more generic awareness than a particular cause like a fear. The simple modern “awesome” has nothing to do with fear, the “awesome” are kittens, shoes, and manners, anything worthy of admiration and inspiration. The standing IE etymology ignores mental inspiration and explores its fear utility, connecting awe with Gk. akhos αχος “pain, grief, ally”, which does not make sense in respect to kittens and shoes, although its semantics and phonetics make it a certain allophone of the Türkic ö- (v.) and ög (n.). The English spelling quite clearly attempts to render the labial ö in the ö- and ög with available means (the w is a labial voiceless phoneme), and the spellings with auslaut -g and Gk. -kh-/-χ- faithfully render the Türkic noun ög, i.e. ö-/ög “to think/a thought”. The A.-Sax. egesa faithfully renders the Türkic ögsa, a desiderative form expressing intensity, formed with the Türkic basic suffix -sa. The IE languages do not have single phoneme verbs and nouns other than some interjections and prepositions, any linguists in search for etymology should avail themselves of that fact unique for the western Eurasia. The numerous single phoneme words, like “think” and “eat”, probably ascend to the very beginning of the abstract human thought. English has inherited and preserved both words, a non-bigoted science would readily acknowledge and celebrate that. The word, preserved in its pristine phonetic form, in its verbal, substantive, and adjectival roles, with its abstract and concrete semantics, with preserved embedded Türkic suffixes, with a history of more than a millennium long, constitutes a fierce case of paradigmatic transfer for absolutely indisputable common genetic origin. It is still the same single phoneme word on the banks of the Thames or the Itil (Volga). Distribution of the word is consistent with migrations from the Eurasian Steppe belt and the Corded Ware expanses, with prongs emanating westward to the Baltics, Atlantic, and Mediterranean. The Celtic forms still echo the modern Eng. and CT forms, in spite of the parallel 12,000 years of independent existence and many alien influences after the Celtic circum-Mediterranean departure from the Eastern Europe sometime in the 5th-4th mill. BC. That attests to a common linguistic phylum ascending to the 6th mill. BC, and an existence there of a single-phoneme vocabulary. The autochthony of the word in the A.-Sax. is attested by a field of more than 30 derivational words and forms, a major impact for a limited abstract notion of “astonishment”. That number excludes the sibling words like the A.-Sax. aegleaw “law-knowing” from the ög “knowledge” and efen “awe, fear, horror, peril” from the ö- “think, reflect, delve, understand” that belong to the same base mental function. English has inherited derivatives awesome, awful, awing, awe-inspiring.

English bale (v., n.) “bound bundle” ~ Türkic bele- (v.) “bind, wrap, lace”. Cognates: A.-Sax. belə(can) “surround completely”, belecgan “to cover, surround”, belegde “covered”; OHG balla “ball”, Dan. baal, Du. bale, Norse balle, vikle; Sw. bal; Gmn. Ballen; OFr. bale “rolled-up bundle”; Welsh bels, byrnu; Sl. pelenat, pelenka, (ïåëåíàòü, ïåëåíêà) “wrap”; Hu. bala; Kor. beil 베일; Heb. byyl (בייל). Distribution of the allophones runs along Eurasian steppes. Cognates demonstrate amazing uniformity across linguistic families, attesting to widespread borrowing, and high mobility across Eurasia connected with active trade. The main spreaders of the word probably were commerce and women, with bales of goods and mamas swaddling babies. Numerous A.-Sax. derivatives and semantic extensions attest to the antiquity of the word in the incipient A.-Sax. language. The phonetic and semantic match makes the Türkic origin inescapable. See band, bazaar, bunch, bundle, belt, bucket.

English band (v., n.) “bind, tie, ring” (Sw N/A, F1524, 0.005%) ~ Türkic ba- (v.) “bind, tie, bound”. A reflexive-passive form of the verb ba- is ban-/ba:n- (v.) “bind, tied on itself, bound” and its m- allophone man- (v.). Cognates: A.-Sax. bindan, ONorse band “tie strip”, OHG binda, Goth. bandi, bandwa; MIr. bainna “bracelet”, Welsh band; OFr. bande, ONFr. bende, Fr. bander, Sp. bandana; Skt. bandhah; Taj. moftan; Mong. bind; Mongor. bo-; Kor. mukkda (묶다); Jap. baindo (バインド). The absence of Lat. cognate indicates particular north-west and south-east paths, the Skt. form dates the word to the migratory split time before 1500 BC. The OFr. form allows to suggest Burgund-Provence source, the ONFr. form points to Alans (Amorican Alans) of the 5th c. AD. The MIr. distinct semantics suggests 2800 BC circum-Mediterranean route if not an acquisition. The intransitive form ba- of the verb have not survived in the European languages, but the transitive form of the verb ban- not only survived together with its agglutinated Türkic suffix -n, but blossomed into uncounted derivatives in numerous languages, probably due to trading lingo in addition to amalgamation. The semantic and phonetic derivatives of the Türkic intransitive verbal stem ba- “bind, tie, bound” in English are extremely numerous, from band, bind, bound to military and social bands to the commodities like Band-Aid. The substantial Türkic-based English lexical cluster related to tying and packaging (bale, band, bind, etc.) and related derivatives (Cf. bandage, bonding, etc.) inextricably attest to their Türkic origin. See bag, bale, bundle.

English barge (v.) “break into, crash heavily into” ~ Türkic bar- (v.) “come, arrive”. A suffix -ɣ (-g) adds an aspect of a danger-warning and a 2nd pers, likely internalized as a part of the root with an air of calamity. The IE etymology on purely phonetical homophony confuses the verb barge “come” with semantically incompatible noun barge “small vessel” and its verbal derivative barge “transport by barge”, of a root barq- “construct”, see barge (n.). The semantics “sudden intervention” and “crash into” must be a semantic carryover or a local emphasis. Semantics may have conflated with the notion for the barge “transport”.

English bark (of dogs) (v., n.) ~ Türkic ür, Chuv. ver- (v.) “bark (of dogs)”. Cognates: A.-Sax. beorc (n.), ONorse berkja “bark”; Tr. barak “dog (type)”; Sl. synonyms laika (ëàéêà) “barking (dog)”, Sl. lai (ëàé) “bark”. The Chuvash form with a prosthetic anlaut v- (corresponding to English b-) is an allophone of the Türkic forms eerer, hur, örü, üjrek, üjürge, ür, üre, ürü, ürüü, they represent a relict that went westward to Atlantic. Other forms extend all the way to Pacific. The Tr. form barak attests that prosthetization occurred within the Türkic milieu, it attests an eastward cultural exchange. No IE etymology, a supple stipulation “of echoic origin” is as far from etymology as it gets. Chuvash is thought to belong to the Ogur branch, which dominated the Eastern and Central Europe for a millennia, from before the turn of the eras to beyond the 10th c. AD. The English dog bark with prosthetic anlaut consonant is consistent with other Oguric traces in Gmc. and English languages, along the lines of ür- > ver- > bar-. The form barak is formed with deverbal suffixes for noun/adjectives and abstract nouns, a common suffix -k (bark) and “rare” (in the eastern sources) suffix -ak(barak). The assertion of “echoic origin” may refer to the Türkic verb ür-, which very well may predate domestication of the wolfs. Eventually it developed derivatives for the notions of “bark” and “dog” using available grammatical tools. The recorded A.-Sax. forms bearca, bearcae, beorc point to difficulties in rendering the quality of the vowel using Roman alphabet.

English bash (v.) “hit hard”, (n.) “heavy blow” ~ Türkic ba:s, ba:š (ba:sh), ba:θ (n.), ba:sa, ba:ša- (ba:sha-) ba:θa (v.) “bash, whack, attack, assail, assault, wound”. The polysemantic verb's prime notion is “push, press from above”, along with nine other related clusters. The meaning “bash” comes from the notion “tramp, thresh, pound”. Cognates: A.-Sax. pote, potian “butt, push”, ONorse basca (basha-), beysta “strike”, Sw. basa “baste, whip, flog, lash”, Dan. baske “beat, strike, cudgel”; Ir. bruth, putaidh “push”, Gael. bruth, buille “blow, push”; Catalan festa “strike”; Galician bater “strike”; Rus. bastryk “yoke, weight, oppression”; Az. vurmaq “bash, shoot”; Mong. basu- “oppress”; Manchu basala “kick”; Kor. matta “break, crush”. No IE cognates outside NW European area on the general background of the entire Eurasian width. Like numerous other Turkisms, bash was probably lurking under the radar of the contemporary grammarians. A phonetic consonance between perfectly synonymous bash and bat allows to suggest that bash is an allophone of the bat, that both ascend to a primeval root ba-  (Sevortyan 1974 vol. 1 p. 89), see band. Then the attestation of this word runs to the Sumerian language of the 4th mill. BC (Sum. badd “beat, strike”). And the time of separation between the Türkic and English ascends to the time of the A.-Sax. Türkic-Gmc. amalgamation. The ONorse and Sw. had preserved the Türkic morphology in forming denoun verbs by suffixing -a. The perfect phonetic and semantic match, the authentic suffix, The global spread, and the status of the non-IE origin leave no doubts of the Türkic origin. The word belongs to the host of Turkisms that survived in English and Türkic without any substantive change. The entire body of intact survivors constitutes an insurmountable evidence of a massive paradigmatic transfer. See ambush, bat.

English bat (v.) “beat”, (n.) “bat,  club” ~ Türkic bad(ar) (v., n.) “beat, strike”. The Scythian pata “to strike, to kill” was explained by Herodotus IV.110 as a word for “kill” in the compound eorpata - those who are killing their husbands (with Türk. er “man, husband”, Cf. A.-Sax. wer “man, husband”). The Scythian eorpata in Türkic is erbadar, in A.-Sax. it would be werbatt/werbeot, clearly related reflexes. Cognates: A.-Sax. batt “bat, cudgel, club”, beot and (rarely) beoft “beat, strike, thrust, dash, hurt, injure, tramp, tread”, fystgebeat “blow, beat with fist”; OBreton bath “cudgel, club”, Ir., Gael. bat, bata “cudgel, staff”; OFr. batte “pestle”, Lat. battuere, LLat. battre “strike”; Sum. badd “beat, strike”. It is incompatible in Avesta, where pada is “heritage, offspring”. The Sumerian badd is the oldest record for bat (v., n.) ascending to the 4th mill. BC, it is construed as “to thresh (beat hard) with sledge (hammer)”. “To thresh” clearly stands for “to beat”, and “with sledge” clearly stands for “with bat”. This Sum. word aligns with the Türkic Bulgar folklore story of their descent from the Sumerians, linking Türkic with the Sumerian and Scythian, and then on to Gmc. > Eng. The Isfahan Codex in Yerevan with Hunnic grammar and wordlist from the 5th century AD gives a Hunnic batten as “push”, apparently with a minor semantic twist produced by ancient or modern Armenian translations, but still in the ballpark. Attested traces across Eurasia sequentially cover the Celtic circum-Mediterranean migrants from the E. Europe ca. 5th mill. BC, then the agglutinative “isolate” Sum. ca. 4th mill. BC, then the Scythian Kurganians from the Central Siberia ca. 3rd mill. BC, then the entire Eurasian agglutinative family of the Türkic Kurganians, and reach to our days. The appeal to the unattested illusive “PIE” clone *bhat- “strike” is worthless. Rather, the traces reflect an amazingly wide distribution long before the Corded Ware temporal period and footprint. Traces reflect a tight correlation with the attested spread of the R1b and the (“Hunnic”) R1a Y-DNA haplogroups. The particular three-phoneme word must have numerous homophones in nearly every language on the mother Earth, like the vat, but and butt in Eng., a huge playground for lively imagination and storytelling. Of a relevance to the etymology of the word are only the ones that directly convey the notion “beat, strike”. The Türkic - A.-Sax. - Eng. heritage is beyond any doubts. See battle, pat.

English bath (v., n.) “immerse in water” (Sw N/A, F1370, 0.01%) ~ Türkic bat (v.) “immerse in water”. Ultimately a derivative of the verb ba- “bind, tie, bound”, see band. A unifying generalized meaning of the verb bat- is “enter into”. Depending on special conditions of its manifestation, it is modified, becomes specific, and then determines the portable, by origin or use, meanings of the verb. The Türkic verb bat- comes in a range of forms and meanings. The stem forms are also put-, but-, pat-, bad-, and ban-, randomly scattered among different languages, predominantly in the form bat-. The bat- and ban- are acoustically different allophones, the verbal names for the same effect in related communities, Cf. bath and banya, q.v. The range of the meanings is astonishing, from “bath/bathe” to “bankrupt” (i.e. “(river) bank, tub wall + rupture” > “go under”), “poke, pierce, stab” (i.e. “reach under”), and to “dare” (i.e. “dive under”), altogether more than 50 meanings, although in each individual language the number is severely narrower, 5 “bath/bathe” in the modern English. The causative-transitive suffix -t- and causative-reflexive suffix -n- are nearly synonymous, the -n- adding an aspect of benefitting the speaker. The preservation or an absence thereof of the initial verb ba- in the daughter languages is irrelevant, and a range of its derivatives from various paths can peacefully co-exist in any language, Cf. band and bath. A homophonic verb büt-/püt- “come to an end, be finished” (Cf. bud, butt) may have further confused internalization, semantically conflating the process of moving “put, place” with the end of the process “have put, have placed”. Semantic difference between the two is on the level of mere shade. Cognates: A.-Sax. bað, bæð, baeð, baðu, bathu “bath, bathing”, also “water, etc., for bathing”, baezere “baptiser, baptist”, bedipan (v.) “dip, immerse”, pitt, pytt “well, hole, pit”, piða “inner, pith”, and so on; ONorse bað, botte “pail”, MDu. bat, Gmn. bad; Welsh baddon, maddon; Lat. batus, puteus “pustule, blister”, Sp. bañera with -n-, Fr. bain with -n-; Russ. banya (áàíÿ) with -n-, “bath”; Balt. (Latv.) vanna with -n-, (Lith.) vonia with -n-, Arm. baghnik (բաղնիք); Georg. abano (აბანო) with -n-; Mong. bann with -n-; Kor. baseu (바스); Jap. basu (バス); Heb. bath (בַּת) “archaic unit of volume”. See put for more cognates. Lexical and spatial distribution shows that of the 50+ semantic meanings only a small stream was carried to the northwestern Europe: a core meaning bath, plus derivatives put, well, poke. That lexical stream reached Europe very selectively also: a smatter of northwestern communities, plus spotty Lat. and Sl. cognates. But cognates extend eastward down to the Pacific. Distribution reliably excludes both “PIE” and “PG” origin. No suitable IE parallels, the IE etymological implication of the Lat. linguistic influence on the Anglo-Saxons, Sakha (Yakuts), Uigurs, Koreans, etc., and a faux “PIE” *bʰeh- “to warm” are absurdities waiting to be rescinded. Bathing was associated with hot water, especially with hot springs. The Somerset city in England, the A.-Sax. (OE) Baðun, called so for its hot springs, exhibits ancient Türkic-Celtic symbiosis. Another toponymic form is the Gmn. Baden. The paradigmatic transfer of the vernacular, phonetic, and semantic complexity indelibly attests to the origins in the Türkic milieu. A major advance in analyzing the word bat-/bat was done by E.V. Sevortyan, 1978, who brought together specks of unheeded knowledge scattered in numerous patchy works but not covered in etymological compendiums. We may feel bad for the throngs of the IE etymologists who toiled to compile a rational story. The true victims were the misled masses who trust patriotic disinformation. See band, put.

English battle (v., n.) “open clash” ~ Türkic pata “strike, kill”, see bat. An allophonic derivative ultimately cognate of the Sumerian, Scythian, and Türkic pata explained by Herodotus IV 110 as a Scythian word for “kill” in the compound eorpata - those who are killing their husbands (Türk. er “husband”). The word is incompatible in Avesta, where pada “heritage, offspring”. The (rare, i.e. archaic) suffix -l forms deverbal and denoun adj. batl that carries semantic extension of “brave, daring, decisive” etc. Another adj./noun is synonymous batyr widely known as a heroic title from the Hunnic times to the present, and across multiple linguistic barriers. Cognates: A.-Sax. beado- (prefix to denote battle, war, slaughter, cruelty): beadogrim “grim battle”, OFr. bataille; LLat. battualia, battuere; all derivatives of bat-. European languages have at least another 16 different synonyms, the form bat- is nearly exclusively a Romance phenomenon. Cognates for batyr: Arab. batal, battal “hero”; Pers. bahadur; LSkt. bahadurah, Av. baxtar; Osset. batyra, batra, paxar; Batraz (name); Alan Badur (name); Mong. ba’atur, bagatur, Sagay matır (b- > m-); Hu. bator, Bator (name); Sl., Rus. bogatyr (áîãàòûðü); all designate “strongman, hero, brave, warrior”. The deverbal suffix -ur/-ar and their allophones denote personal possession. The inlaut -g-/-ğ-/-h-/-x- are typical for Mong. suffixed articulation. The word is ultimately known from Sumerian, see bat. The Isfahan Codex in Yerevan with Hunnic grammar and wordlist from the 5th century AD gives a Hunnic batten “push” (with instrumental suffix -en), apparently with a semantic twist produced by ancient or modern Armenian translators, but still in the ballpark. The widespread origin from the Türkic phylum is incontestable. See bat, pat.

Be, bear – Preliminary Note. One of the most striking aspects of the IE family is the claim on sweeping correlations spanning entire conjugational paradigms observed across some, but utterly far from all, IE languages. The textbooks on IE linguistic theology uniformly cite tables illustrating the verb “to bear, to carry” which shows amazing lexical and morphological correspondences across Sanskrit, Greek, Latin, Gothic, Old Irish, etc., in most conjugational forms of the present tense, and in the lexical forms (Cf. Sanskrit bharami “I bear” (vs. Türkic bermi), Latin fero, Greek fero (φερω), Gothic baira, Old Irish beru “I bear” (vs. Türkic berü), etc.). Textbooks uniformly cite tables with verb “to be” showing similar lexical and morphological correlations across some, but utterly far from all, IE languages of the entire IE paradigm. The IE theory was founded on an underlying assumption that these patterns cannot be acquired and must be inherited. For example, Nichols (1996) says that these patterns are “individual-identifying evidence”, the evidence “obvious to the naked eye of a trained scholar” that the languages are genetically related. Alternatively, dissidents assert that because of the language change occurring during millennia, these patterns are too good to be the result of inheritance. One element the textbook tables are missing are the corresponding Türkic forms. The puzzle could be resolved on the spot: be, bear, dur- (Cf. endure), etc. were conveyed by the earliest Kurgan waves, and also acquired by various extracts from the central and western Europe during their millennium-long stay in the Eastern Europe. They were adapted and internalized by various people speaking various vernaculars. Some were internalized complete with the suffixes, Cf. Sl. brat, beru (áðàòü, áåðó) “to take, I take”, and Skt. bharami. The sweeping correlations spanning entire conjugational paradigms, spottily observed across some IE languages, form a theoretical base for justifying under Family Tree model a genetic origin of the whole IE family. The spotty paradigm is then used as an example of borrowability of the entire conjugational pattern between languages, Cf. the Greek-derived tense markers in Romani characterized by a wholesale import of the entire paradigm. Thus the elements of the archaic Türkic conjugational system become a proof of a viability of a wholesale import, and a pillar in the theoretical base for the entire IE paradigm.

English be (v.) “exist” (Sw N/A, F25, 0.73%) ~ Türkic bol-/mol-, buol- (v.) “be”. The word is extremely universal, at its base numbering 17 listed semantic clusters and 13 service functions. In addition to the b/m split, the form evolved b- > v- > 0 (bol, vol, ol), leading to the present dominating spread of ol along with numerous relict and dialectal forms. A diphthong -uo- is a dialectal form of a long vowel -o-. Cognates: A.-Sax. beon, beom, bion “be, exist, come to be, become, happen”, OHG bim “I am”, bist “thou art”, Gmn. bin, bist; OIr. bheith “be”, bi'u “I am”, Gael. bhith “be”, bha “was”, Welsh menjadi, berada; Lat. fui “I was”; Gk. phu- “become”; OCS. byti (áûòè) “be”; Balt. (Lith.) bu'ti “to be”, Rus. byt “to be”, etc.; Av. bu-; Pers. bu-, bulhös; Skt. bhavah “becoming”, bhavati “becomes, happens”; Fin. ole-, on “be”, Hu. vol-, van- “be”; Mong. forms and semantics are largely identical with Türkic forms, MMong. bol-, Khal. bol-, Buriat bolo-, Dag. bol-; Mogol bolu; Evenk bola-; Nenets bol-; Chinese bei 被 “be”. In Türkic: OT bol-, Chuv. pol-, Tat. bul-, Yak. buol-, Kirghiz bol-, Khak. pol-, Yugur pol-, Turkish bul-, dial. mol- “be, become”, Uz. mul-. The origin of the A.-Sax. forms is clear, they are bol + -än from a phonetic series -an/-än past tense participle suffix, bolan > A.-Sax. beon, see been. In this construction the specific form bul-/bol- is strictly conceptual, it could have been anything from the attested series bi-/by-/bu-/be-/bo-, with consonant variations b-/p-/w-/f-. Historically, all non-Türkic ( “IE”, Chinese, Mongolic and Tungus/Manchu) examples are contiguous with the Great Steppe, and either contain, or used to have contained sizable Türkic component. Distribution of the cognates is nearly ubiquitous. In linguistic terms, the word points to what used to be called a Nostratic origin. The peculiar m/b alternation present on the opposite ends of the Eurasian Türkic areal attests to its indigenous origin and deep roots. In English, some paired compounds with be seem to preserve intact, Cf. English “be abundant” ~ Türkic abadan bol “be (become) crowded, populous” > A.-Sax. beon, beom, bion abadan > be abundant. Another Türkic verbal stem for “being (v.)” var-/bar-/par- was preserved in Goth. and daughter languages, incl. English, as var and was, see was. Another Türkic verbal stem for “being (v.)”, with a notion of permanency, is dur-/tur- “be, do, stay, remain, stationary, halt, aux. verb “do”, copula”, which functionally overlaps with bol-/be to express “exist continuously”, it very successfully survived in the English do expressing abstract permanent action (e.g. “Do you like?”, “Do him in, do it now”, etc.). Welsh forms echo the very prime Türkic forms. The presence of the cognates in Celtic languages attests to its existence prior to the Celtic departure from the N. Pontic in the 4th mill. BC at the latest, the Indo-Aryan presence attests that it existed in the N. Pontic area before their departure ca. 2nd mill. BC. The Chinese presence ascends to the arrival of the nomadic pastoralists toward the end of the 3rd mill. BC (the legendary Xia dynasty) reemphasized ca.1750 BC with the arrival of the “Zhou Scythians” to the Shang China. Notably, phonetically Gmc. forms are much closer to the Türkic forms than their older Celtic siblings. The IE etymology is lost in utter confusion, calls it “fragment collection, conglomeration”, and still comes with an unattested faux “PIE root” *bheue- with true semantics of “be, exist”. The trio bol-, var-, and dur- constitute an authentic case of paradigmatic transfer attesting to traceable and veritable genetic connection from the Türkic Kurganic milieu. See Be, bear – Preliminary Note, been, do, durable, duration, duress, was.

English bear (v.) “carry” (Sw N/A, F1114, 0.01%) ~ Türkic be:r- (v.) “carry”. The Türkic be:r- (v.) is extremely polysemantic: “give, hand over, grant, bring, bring upon”; it lived on onto Eng. and other European languages as a paradigm, complete with its semantic wealth, with literal and figurative meanings, like “marry off”, “pay”. With the meaning of “give”, it is a part of an allophonic triplet give, bestow, and bear preserved in English as a paradigm with some modifications in semantic accents. The homophonous bear (animal) fr. Türkic bori makes its a foursome transfer paradigm, see bear (animal). It is Türkic most used auxiliary verb with participle, expressing direction for a second person and expressing perfect tense actions: “carried out conquest”, “carried out ordering”, “carry out sermonizing”, “carried out breaking something”, “bear praise”, “bear reward”, “bear title”, “bore a baby”. English continues to carry both the literal meaning (“bear fruit” ~“give fruit”) and the popular auxiliary function (“bear expenses” ~  “pay expenses”). Cognates: A.-Sax. ber(an), bær, bor(en) “bear, bring, bring forth, produce, endure, sustain, wear”, OSax. beran, OFris. bera, OHG beran, ONorse bera, Goth. bairan, Gmn. (ge)bär(en) “carry, bear, give (birth)”; Bulg. (Tr.) berü “give”; Sl. bremya (áðåìÿ), ber(emen)- áåð(åìåí)-) “load, carry (child, burden, bring)”, Bulg. (Sl.) bir- “give”. Not only the phonetics and semantics of the stem are identical to the Türkic word, but the grammatical function of the participle is accomplished with the retained Türkic suffix -an/-än that forms a past tense participle: Tr. beran “born”. The verb “carry, bear” shows amazing correspondences across Tr., Skt., Gk., Lat., Goth., OIr., etc., in lexical forms and in most forms of present tense declension (Cf. “I carry”: Tr. berim, Skt. bharami, Gk. fero (φερω), Lat. fero, Goth. baira, OIr. beru, etc. The Skt. agglutinated form is a clone of the Türkic agglutinated form. According to Kashgari, truncation is typical for the western Bulgar language, and Herodotus (4.117) wrote about Sarmatians in the same area who spoke Scythian incorrectly. Variations in spelling and slight phonetic modifications can't obscure the fact that this word survived from the archaic Türkic substrate into modernity with all its polysemantic cache and complete with the Türkic morphology. No need for unattested “PIE reconstructions” and unrealistic reverse engineering of a particular application. Distribution of cognates is generally limited to the Steppe Belt and its periphery: Gmc., Sl., and Türkic languages. The “birthing” triplet ber-, döl-, ken- is another paradigmatic evidence attesting to its Türkic roots, it is in a way a trade terminology passed from generation of “birthing” practitioners to their “birthing” daughters and granddaughters with the other “secrets of the trade”, and it can serve as unequaled linguistic marker inherited from generation to generation immune to all other societal turmoils. The homophony with the noun bear “large mammal” is predicated by the homophony of the Türkic stems of unrelated origin: ber- and böri. See Be, bear – Preliminary Note, bear (animal), bestow, confer, deliver, give, gene.

English been (v.) “past participle of be (exist)” (Sw N/A, F91, 0.21%) ~ Türkic ben “past of be (exist)”, see be for the root word. Cf. ičinte: beg ben “I've been (a) bek (prince)”, bašı: ben “(I) was (a) boss”. Ultimately a derivative fr. bol- (v.) “be, exist”. The development of been is clear, ben > A.-Sax. beon > Eng. been. The form bul-/bol- is strictly conceptual, it could have been anything from the attested series bi-/by-/bu-/be-/bo-, with consonant variations b-/p-/w-/f-. The original linguistic efficiency is demonstrated by the opposition ol- (bol- ) “to be, to exist” vs. ӧl- (bӧl- ) “not to be, not to exist”, i.e. die, death. Allophones of ӧl- (bӧl- ) are widely used in Gmc. daily and ritual languages, Cf. Walhalla “death-hall” with wal < ӧl- “death”. The known history of lexical transformations past the beon stage (13th c.) is well established, it is quite twisted, convoluted, unpredictable, and unsupportable by any “phonetic laws” of the 19th and 20th centuries. Quite the opposite, the rule is “there are no rules”, as is proper due to a stochastic process of amalgamation and adjustment. To overcome the conundrum, the IE theoreticians came up with a concept of a “b-root” with a flexibility of a jell, which can accommodate all known forms for the generic notion “be, exist”, and invented fictitious PIE and PG forms *bheue- and *biju- respectively. The Türkic bul-/bol-/ol- also nicely fits in the rubber scenario excellent in observations but hapless in explaining them. The presence of the Türkic, Celtic, Chinese, Mongolian, Avestan, Persian, Sanskrit etc. forms flatly defies the IE Family Tree model and irrational PIE and PG figments. Remarkably, the Türkic grammatic function of the past tense participle expressed with the suffix -an/-än has survived into the Gmc. bin, A.-Sax. beon, and modern English been alongside the Türkic OT, Kirghiz bolan/olan, Turkish, Tatar bulan, Khakass polan, and Uigur polan. Thus, a true linguist would not have problems locating cognates, seeing commonalities, and coming to sound non-prejudiced conclusions. The presence of the oldest Celtic form bi'u attests to the existence of that form in the N. Pontic at 6th-5th mill. BC, predating by far the word's migration with the Aryan farmers to the south-central Asia in the 2nd mill BC. See be; Be, bear – Preliminary Note; Walhalla; was.

English beg (v.) “plead for something” (Sw N/A, F1837, 0.004%) ~ Türkic bag/baɣ/baq (v.) “to look pleading, to plead”. English allophonic form with -g/-d alternation, with numerous derivatives: beggar, beg pardon, beg for mercy, “beg a question”. Cognates: A.-Sax. began, bugan “bow, bow down, bend, stoop”, bedecian “to beg”, OFr. begart (beggar). No sound etymology, no viable Gmc. connection, no IE parallels. A source could be an Alanian source of the Amorican Alans who moved into Brittany in the 5th c. AD, or of the Brits of Brittany with Sarmatian, or Scythian, or even Cimmerian via Frisian connections. In the English lexicon beg is known from ca.1200. In the Middle Age society, pleading with a lord must have been a daily affair, keeping the word rolling.

English bellow (v.) “sound of an animal” ~ Türkic belä- [belə-], bele:-, be:le:- (v.) “bleat, sound of a sheep”. English has variations bawl “cry loudly”, yowl “utter shrieks”, holla (b- <=> h) “sound of an animal”. Cognates: A.-Sax. bylgan, bylgean, bellan “to bellow”, hlowan, hlowung “lowing, bleating, bellowing” (b- <=> h), Du. blaten; Welsh beuo; Balt. (Lith.) bliuti/baubti; Sl. bleyat (áëåÿòü); Gk. velazo (βελάζω); Fin. mylviä (< myl/byl); European with b/m alternation: Lat. mugire, Port. mugir; Latv. baurot/maut; with b/h alternation: Fr. beugler/hurler; with l/h alternation: Basque behean; with b/br alternation: Da. brøle, Norw. brøle, Sw. vrala, Gmn. brüllen; Gael. berrar; It. barrito; Balt. (Latv.) baurot/maut. Cognates point to ubiquity of distribution, and various paths of acquisition. The IE etymology abstains from parallels, apparently suggesting echoic origin from numerous sources. The echoic origin apparently would be true and beyond our horizons. The faux “IE proto-word” *bhel- unwittingly is a replica of the attested Türkic belä- “to bellow”. The extant phonetic forms allow to diagnose much more specific sources. Notably, the A.-Sax. bylgan has preserved Türkic suffix -gan/-ɣan/-an (-än, -ın; -gän; -qan, -kän) that forms deverbal nouns: belä- (v.) > belägän (n.) > A.-Sax. bylgan (n.) “bellow”, indicating a temporally close transition from the Türkic to the A.-Sax. The homophonic bellows (n.) is probably related via woodwind flute. The Türkic origin is impeccable.

English bestow “grant, give” (v.) ~ Türkic bağıš-, baɣïš- (with silently articulated -ğ-, baish), bağıšla:- (baishla) (v.) “bestow, give, dedicate, confer, present, gift”, ultimately from bağ, baɣ “bag, bundle”, a derivative of the verb ba- “bind, tie, bound”, see band. Cognates: Pers. baxš “gift”, Mong. (Kalmyk) ba- “sign, mark”, Cf. Kipchak baiɣa, baigä, Chagatai bairaq “prize, award”. The Pers. word is idiosyncratic, a Türkic loanword not present in other IE branches. Distribution: majority of the Türkic languages, isolated presence in Eng. and Pers. The “IE” etymology suggests to parse supposedly A.-Sax. unattested bestow as be- + stow, lit. “emplace”, apparently modeled after A.-Sax. bigian, beagian “to crown”, where -gian is a version of the Türkic verbal suffix -gan (participle, instrumental), an allophone of the Türkic deverbal suffix -ɣan/-gän/-qan/-kän (noun instrumental outcome, archaic verbal participle): lit. “imbeing, (im)been(ed)”. Outwardly, that “IE, native” bestow looks quite suitable and phonetically precise, except that the construct be- + stow semantically creates a divergent paradigm that does not convey the notion of “grant, give”, the stow “emplace” is not exactly a “give”, and the Pers. cognate remains unexplainable. A root best can be played on to produce many more “IE etymologies”. The Türkic trifecta of terms for “give” comprises a hierarchy, qïv- (“bless, confer”), bağıš- (“bestow, grant”), ber- (“give, bring, bear”), with qïv- most dignified and ber- most mundane. English has preserved allophones of all three words, give, bestow, and bear, with some change in semantic accents (Cf. dignified bestow and mundane give, the bear is more “carry” and less “give”). This is a case of an entire paradigm borrowing, a clear attestation of the genetic inheritance, and an impeccable evidence on the linguistic scales. In the England's highly stratified class society, such an important term as bestow could not fail to appear in historical records way before the 15th c., that attests to a winding path for this word, and its survival in the folk lingo under a radar of the literary English. Notably, the artificial compound be- + stow unwittingly combines the Eng. allophone of the Türkic be- with the IE stow, following the seamless amalgamation mode frequently encountered in A.-Sax. lexicon and inherited by Eng. See be, bear, give.

English bet (v., n.) “wager” ~ Türkic büt- (v.) “be determined, confirmed; believe, trust” bütrüš- (v.) “wager, bet (between two sides)”. The derivatives of the verbal stem büt- neatly fall into the notion of contest for something to be confirmed, the truth to be determined: bütrüš- “contest, seek truth”, bütür- “confirm, find out, attest”, bütünlä- “seek truth, search for truth”. Türkic has 11+ words to express “wager”, vs. the Eng. two. That attests that gaming and competition were more than entertainment, they were a national pastime, including children. Cognates: Supposedly none, “argot of petty criminals, of unknown origin”; In reality: A.-Sax. bet “better, one who bets”; Dan. vædde, Norse vedde, Sw. (sla) vad, Gmn. vette, Icl. veðja; Fr. pari, Sp. apuesta, Rum. pariu, Portug., Catalan, Galician aposta, Lat. pateretur, pignus; Serb. opklada (îïêëàäà); Alb. bast; Basque apustu; Fin. veto, Est. panus, Hu. tet; Kaz., Uz. bet, Turk. bahis; Mong. bootsoo (áîîöîî) “bet”. In Türkic languages the alternation b-/v-/p- is regular, and some receptor languages also regularly palatalize initial b-. A prefix a-  (apuesta etc.) is also regular. Some random modifications at internalization should be expected. Some presumed cognates may not belong to the lineup at all. Still, a total absence of cognates is a gross distortion. IE etymology suggests a PG origin, i.e. either IE or by default non-IE, from nominally homophonic bait, A.-Sax. bat “food”, ONorse beita “food”, beit “pasture”. The semantic incongruity puts that speculation beyond contempt, from “bet” to “food”. In contrast, the Türkic etymology is semantically perfect, phonetically reasonably close. It does not need to appeal to the argot of the criminals, especially so considering that betting is a ubiquitous standing English tradition of all classes. The idiom I bet you illustrates the notion of bet “wager” in a sense of “I am sure (of this truth), I would wager (that this is truth)”. The A.-Sax. betrendan “to roll” may be a metaphorical derivative of bet- “roll a dice”. The homophonic better “betting person” and the native better comparative of “good” are not related. The Türkic origin stands on its own, supported by the lucid lineup of the cognates, known historical development, and unequivocal genetic tracings.

English bill (v.) “advertise, publicize” (Sw N/A, F899, 0.01%) ~ Türkic bil (v.) “find out, learn”. While Türkic, with its refined morphological mechanism for producing grammatical forms, tends to use active voice, English has a preference for passive voice verbs: “he is billed as an expert” means “he is said to be an expert”. Türkic billüg is “found out, known”, with extension to “famous”; “advertise, publicize” is formed with causative tense suffix -dur > bildur “notify, inform”. A derivative of bill (v.) is bill (n.) made quite famous with the Bill of Rights, followed up with thousands bills approved regularly by Congress. Each enterprise has a billing system, we get daily, weekly, monthly, and annual bills, we used to billet militia and army, we carry bills of different denomination in our wallets, we trace billables, are overbilled and underbilled. The “learned” in the form Bilge, usually translated as “wise”, was a popular title of the Türkic Kagans, including the famous hero of the Bilge-Kagan inscription. The root bil- with the semantics “able” found employment as a suffix -able in English innovations like suitable and doable, the last is a compound of two reflexes, the “do” and bil- “able”. Similar internalizations took place in other amalgamated languages like Lat, Cf. bil- “able” > Lat. -abilis, -ibilis ~ Eng. -able, -ible. Another prominent derivative is the English verb feel “feeling”, a 15th c. contraction of the Türkic derivative bilig/bilin-, see feel. Three words, bill (0.01%, rating 899), feel (0.13%, rating 174), and will (0.58%, rating 75) contribute combined, 0.72% frequency usage in English, without accounting for forms like billable, feelingly, willful, and palpation. That brings the total frequency usage of the Türkic bil-'s descendents to about one of every hundred daily words, or 5 words per page. With all the learning-related subjects at hand, the IE etymology did not get even a hint of any IE cognates nor a clue on the origins of the so dear to us existential and linguistic components of our accumulated cultural wealth. See -able, eligible, durable, feel, will.

English bode (v.) “augur” ~ Türkic bodi “insight, achievement of perfect wisdom”; the conventional rendering of the Skt. word is bodhi. Ultimately fr. a root büt-/büt, püt-/püt (v., n.) conveying mystical notions “believe, deity”, “pray, prayer”, “revere, idol, icon, cross”, “refer, truth, fact, verity”. The Türkic word belongs to the class of grammatically most archaic, distinguished by homonymic noun-verb pair and neutrality to transitivity. In English, time has erased linguistic traces of the past religions supplanted by the later doctrines, leaving behind only monuments of the old cultures. The term bodi is a Buddhist religious term of Türkic extraction, with vast spectrum of Indian derivatives: Buddha, bodimant “throne of enlightenment”, bodisatva (bodhisattva) “enlightened (bodhi) being (sattva)”, etc., few of them were absorbed into the Türkic syncretic lexicon. The first Buddha was a Türkic Saka extract Scythian “prince” Shakyamuni (6th c. BC), of the Saka Kurgan tradition, he brought a  Buddhist enlightenment to India. Cognates: A.-Sax. has a whole range of bodi- derivatives: bod “message, precept, preaching, command(ment)”, boda “messenger, herald, prophet, apostle, angel”, bodere “teacher”, bodian “foretell, announce, proclaim, tell, preach”, bodiend “proclaimer, teacher, preacher”, bodlic “command(ment), decree, ordinance”, bodscipe “message, command”, bodong “message, recital, preaching, interpretation”, bodungdaeg “Annunciation Day”; OSax. (gi)bod, Gmn. (ge)bot, ONorse boð; Balto-Sl. (Lith.) budeti “awake”, (OCS) bludet “be alert, follow”; Skt. bodhati “awake, watchful, observe”, buddhah “awakened, enlightened”, bud “Buddha” and “Mercury (planet)”, Av. bütaj “name of one of the demons”; Pers. but- “idol, fetish”, also “beloved, sweetheart”, pwt- “Buddha”; Kor. mitta, mide, midin (믿다) “believe, trust”; Far Eastern China and Japan received their terminology via Sogdian intermediaries; ditto Türkic, Mongol and Tungus-Manchu via Sogdians of the Türkic domains, that is attested by the form put-. A few centuries were needed for the Buddhist terminology to take root and develop in India. That points to the timing of penetration of the matured Buddhist formulaic terminology into the steppe belt. Sogd was an intermediary between the Indian, and Pers. and Türkic phylums. The expanded extent of the A.-Sax. lexicon attests to a long syncretic period and deeply ingrained use of the Buddhist terminology. It was carried by the Sarmat migrants of the Sarmat expansion period not earlier than the 3rd c. BC and not later than the first references to the nomadic Vandals Wendeln “Wonderers” at the turn of the eras. It was syncretically embraced by Tengriism, and spread from India to the western Asia and eastern Europe. The Gmc. cognates attest to a deep penetration of the Sarmat Tengriism into the life and etiology of the Central European (A.-Sax., OSax.) and Northern European (Gmn., ONorse) populations. The Lith. and OCS lexemes attest to the semantics and phonetics extant in the Eastern Europe in the 2nd c. BC. The penetration had to be directional, touching certain tribes and not reaching the others. For temporal and spatial reasons, more westerly archaic Scythians, Cimmerians, the Mesopotamia Kurgan nomads, the northern Tele tribes, and and the archaic Zhou in the east could not have been affected by Buddhism. Those tribes would have preserved the archaic etiology of Tengriism and the archaic religious terminology, missing on the Buddhist formulaic bodi and its numerous derivatives including buddha. That lopsided penetration is reflected in the dictionaries, the OTD succinctly lists a series of Buddhist lexemes, and the EDT ignores them altogether. The influence of the Roman world in disseminating Buddhist terminology to the N. Europe needs to be ruled out, for two reasons: the spread across the Roman world Mazdaism was in conflict with Buddhism and could not have adopted the Buddhist lingo, and the Roman religious influence on the outside world was non-existent. The Romans neither mastered Buddhist terminology nor were in a position to spread it. That again leaves only the Sarmatian part of the Scythian world to carry Buddhist terminology to Europe. The IE etymology suggested an origin linked to OIr. buide “contentment, thanks” barely related to the notion of “augur, enlightenment”. It may be a reuse of a Celtic Kurgans' semantics of the 5th mill. BC before their departure on a circum-Mediterranean anabasis of the 4th mill. BC, separated from its siblings by combined 8,000 years timespan. Otherwise, the suggested PIE and PG etymology belongs to the genre of petty scientific fiction. Although superficially it turned to Skt. for the IE prime root, it completely missed the later Buddhist content of the terminological nest. The semantics of bod- “augur, insight, enlightenment” came from the 6th c. BC Indian soil. Migration of the Buddhist enlightenment to the foggy Albion and its environs, instead of being miraculous or accidental, jibes with the historical canvas on the known migrations of the Celtic Kurgans, Kurgan Sarmats, with the Buddhism's syncretism with Tengriism, and the Gmc./Scandinavian innate religion of Thor of the 1st mill. BC. The return of the mutated Indian term to the Eastern Europe has not caught on because Balto-Sl. languages have already developed their own synonymous lexicon based on the stem ved- “to know” (Cf. Sl. veschiy (âåùèé) (adj.) “prophetic, enlightened”, etc.). It also did not infiltrate the Fennic population of the area. The A.-Sax. term bod- affords a unique opportunity to trace physical and cultural migrations across Eurasia that link the Indian subcontinent with the Atlantic seaboard, it is a unique and credible marker that propagated along the Eurasian steppe belt in the course of the sequential Türkic migrations.

English bore (v.) “drill a hole” (v.) (Sw N/A, F1969, 0.00%) ~ Türkic bur- (bür-) (v.) “twist, wind round, screw”, a subset of a generic meaning “twist, twirl, spin” applicable in non-bore related sense, like twisted vines, etc. Ultimately fr. verb bur-, bür-, buz- that starts with “ruffle, pleat” and is consonant with an antecedent bu- “steam”, probably referring to eddying steam. The meanings of all three stems are both overlapping and divergent. Various allophonic forms include ebir-, egir-, evir-, evür- with a front prosthetic vowel and initial p- (b > p), Cf. “puff”. The r/z alternation is equivalent to r/s alternation, it attests to a most archaic origin. Cognates: A.-Sax. bor “borer, gimlet”, borian “to perforate”, (wede)berge “(helle)bore”, ONorse bora, Sw. borra, OHG boron, MDu. boren, Gmn. bohren, “drill”; Lat. forare “drill”; Sl. allophones of br-, vr-; Russ. bur (áóð) “auger”, buravit (áóðàâèòü) “drill”, vertet (âåðòåòü) “spin”; Bosn. buše-, Serb. bush- (áóø-) “to drill”; Alb. brime “hole”; Fin. porata, Est. puurida, Hu. furni “drill”, forog “rotate”; Baltic languages have their own words; Celtic languages have their own words (e.g. allophones of “drill”); Mong. burgi “eddy (smoke, steam, dust”, buran “tornado”; Even mori “pucker”; Manchu foro “revolve”, forgon “rotate”; Nanai poro “revolve”. Distribution is very specific, NE European limited to a geographically confined area, Eurasian including and adjacent with the Eurasian Steppe Belt. That is consistent with the Eurasian distribution of other words of the Türkic origin. See wrinkle.

English borrow (v.) “receive temporarily” ~ Türkic borč, borğ, borığ (n.) “debt, indebtedness, obligation, loan, borrowing”. Ultimately a deverbal noun derivative fr. the verb be:r- “bear, give”. A deverbal noun suffix is one of the most productive suffixes, same with -ɣ/-ïɣ (ğ/ığ). The spelling bert conveys the same in a passive form, “something given”, like a newborn. The European cognates are limited exclusively to the Gmc. group, attesting to a non-IE origin. Cognates: A.-Sax. beran “bear, carry, bring”, boren pp. of beran, bringan “bring”, borg/borh “debt, pledge, security, obligation, debtor”, etc, with a significant set of derivatives and extensions; ONorse borga “bail, guarantee”, Goth. brahta “brought” (i.e. “brought to” with the Türkic directional suffix -ta), OHG brahta ditto, MDu. borghen “protect, guarantee”, Gmn. borgen “borrow, lend”; Sogd., Syrian porč “debt”; possibly Mong. buru “guilt, responsibility”; Kor. bij (빚) “debt”. Besides Gmc. languages form borğ/borg/borığ (with -ğ/-g) is shared by Azeri, Chagatai, Crimean Tatar, Ottoman, Turkish, and Uigur among the Türkic family. Allophones of the specifically Turkish form bordž and its derivative bordžlu were internalized across South Slavic, Romanian, Albanian, and New Greek languages. Geographical distribution extends from Atlantic to the Far East and across linguistic groups. The focus of the Türkic -ğ/-g form centers on the Aral-Caspian area, consistent with numerous other Türkic-Gmc. lexemes. The word has a notably rich range of phonetic forms, attesting to a long history of circulation and plenty of amalgamation. The distribution, however, pointedly avoids numerous linguistic families and groups, showing local attachment to a native word. With the weight of the complex and overlapping diagnostic evidence, the simplistic phonetic knockoff of a “PIE root” *bhergh- “hide, protect” is perfectly ridiculous. The combined evidence of carryover of the prime stem ber- “bear” and its derivative borg lit. “something carried over, given” present a paradigmatic transfer case that definitively attests to their origin from the Türkic milieu. That evidence is corroborated by a sister case of paradigmatic transfer, the “birthing” triplet terminology ber-, döl-, and ken-, passed from Türkic moms to their daughters and granddaughters, unambiguously attesting to the innate Türkic roots. See bear – Preliminary Note, bear ((v.).

English bother (v., n.) “trouble” (Sw N/A, F1073, 0.01%) ~ Türkic buša-, busa:-, bušan- bosa:-, busan-, (v., n.) “grieve, sorrow, alarm, irritation”. Cognates: A.-Sax. bot “help, relief, advantage, remedy”, Gmn. busse “concern, worry”, beschwerde “trouble”, Dan. besvær “trouble”, Du. bezorgdheid “concern”, beroeren “trouble”; Ir. bodhraigh “bother”, Gael. bodraigeadh “bother”, Welsh boeni, poeni “bother, worry”; Fin. vaivautua, vaiva “bother, trouble”, Hu. baj, banat “trouble”; Ru. buza, buzit (áóçà, áóçèòü) “trouble, nonsense”. The A.-Sax. bot is related to the derivatives bottan/betan “fix, amend, cure (trouble)”, botettan “improve, repair”, botleas “unfixable”, and so on, connected semantically with a contiguous process of recovery. IE etymology is non-existent, no IE cognates whatsoever, no “restored IE proto-word” to go around. A suggestion of Ir. physical “deafen” applied to the emotional “trouble” is unsuitable both semantically and phonetically. The word must have lurked somewhere in the English folk language until it popped out sometime in the 18th c. With the frequency rating of 1073, it has a decent status in English, ahead of such prominent words as “island”, “broken”, “advice”, and “paid”. It was lurking right under the surface. As a probable Celtic contribution to Eng, it parted with the E. Europe ca. 5th-4th mill. BC, creating a 5-millenia phonetic/semantic gap with its less adventurous Asian siblings. The cognate distribution across linguistic families, and its ingrained presence within the Celtic languages attest to a deep antiquity of the underlying emotional notion “trouble, grief” predating the ancient Celtic migration.

English bruise (v., n.) ~ Türkic bürt (n.) “contort, convulse, sprain”, bert, pert, mert (n.) “injure, hurt, bruise, sprain, break, cut, hack, incise, wound”. The Eng. form with an elided vowel leaves that vowel obscure, -e- is likely. These two words with similar semantics attest to a common origin that resulted in wide spectrums of forms and semantics, all connected with process of inflicting injuries (war, wrestling) and the resultant injuries (hurt, bruise, sprain, break, wound). Cognates: A.-Sax. brysan, brysian (v.) “bruise, crush, pound”, brysednes (n.) “bruise, crush”, Goth. brakja “strife”, brikan “break, quarrel”, Anglo-Fr. bruiser “break, smash”, OFr. bruisier “break, shatter”; OIr. bronnaim “I wrong, I hurt” complete with the Türkic suffix -im denoting “me”, Breton brezel “war”, complete with the Türkic passive suffix -l; VLat. brisare “break”; Sl. has a slew of related br- words: borba (áîðüáà) “wrestling”, bron (áðîí) “war”, with numerous extensions and derivatives. The transition from -t to -z may be a result of interdental -t- (-th-), not rendered in the reference sources and receptor languages because of peculiarity of the phoneme. The phonetics falls into quite reasonable margins, the semantics is perfect. The IE etymology resorts to unattested forms starting with br- “smash, cut, break”, unverifiable variations derived from the same late forms with a limited sampling range. Since Lat. did not have the br- root, the presence of late VLat. and OFr. cognates may point to a Burgundian origin.

English bulge (v., n.) “protrude” ~ Türkic beleg (n.) “wrapped, wrapping (gift, etc.)”, a derivative of the verb bele:- “bale, swaddle, wrap”, a derivative of the verb ba:- “bind”. The verb ba:- is extremely productive across Eurasia, it left its traces from Atlantic to Pacific, with allophones “bag” and “pack” most widespread. The origin of the noun beleg is reflected in its main application “to swaddle a baby”. The verbal and adjectival forms probably emerged with the noun. The Eng. verb bulge is a semantic extension of the noun bulge “pouch”. Cognates: Eng. bilge “bulge (ship)”, bulk (v.) “bulge, swell”, belch “swelling burst”, pouch “sac”, A.-Sax. bulge (n.) “bag, wallet”, belg (n.) “bag, purse, leathern bottle, bellows”, balic- “encompass” (lit. “bind-like”, bealuelomm “bond (oppressive)”, baalurip “fetter (oppressive)”, pocca, pohha, poh(ch)a “bag, pocket”, and quite a few more; Goth. balgs “bag, wallet”; OIr. bolg “bag”, Breton bolc'h “pouch (pod, of flax)”; OFr. bouge, boulge “pouch, leather bag, wallet”; Lat. bulga “leather sack”. It could be reasonably suspected that Lat. did not invent the word and spread it to the Goths and the Turkic people all the way to Manchuria. The IE etymology that ascends the Eng. form to Lat. is utterly misguided in its beliefs, and its etymological reference to “budget” and “belly” is equally out of place. The semantic extension from “bind” to Eng. “bulge” appears to be a local innovation, distribution of the derivative bulge “protrude” and underlying bulge “pouch” is consistent with the Sarmatian Kurgans' spread to the C. Europe simultaneously with the emergence of Rome as a colonial power in the Apennines in the 1st mill. BC, but the OIr. and Breton forms point to a possibility that the word was carried to the European soil by the Celtic circum-Mediterranean migration in the 3rd mill. BC, and by the 1st mill. BC it was deeply internalized on the European scene. See bag, bale, band, belt, bundle, burg, pouch.

English bunch (v., n.) “gather into cluster” (v.), “large number” (n.) (Sw N/A, F1228, 0.01%) ~ Türkic bunča (buncha, adv.) “so many, so much”, from Türkic stem bun-/mun- + equitive adverbial suffix -ča. A normative noun form of the stem is bunaz/munaz. Notably, the English word bunch, as well as the form much, have preserved the Türkic equitive adverbial suffix -ča. The concordance is perfect, except for expected grammatic shift (in Türkic, grammatical definition is done by agglutinating suffixes, so any part of speech can be formed from a single stem; in early English, suffixes are dropped, and grammatical function is primarily defined by the word order structure. No IE etymology. The only suggested “possible” cognate is a Flemish boud “bundle”, a transparent cognate of the Türkic bandur- and English bundle. Predictably, because of dialectal m/b alternation, bunča has twins munča (muncha, adv.) and mïnča (myncha, adv.), with closely related meanings: “such a number of, so many, so/thus”, which have found their way into early English and English. Accordingly, English has much, and Spanish has mucho, with no IE connection like the Eng. bunch. See bundle, much.

English bundle (v., n.) “gather into cluster” (v.), “cluster” (n.) ~ Türkic ban-, ba:n- (v.) “bind, tie”, a reflexive and passive of ba:- (v.) “bind, tie”, bandur- (mandur-) (v.) “bind, tie” causative of ba:-. The form bundle is a predictable deverbal noun innovation from some version of bandur-. Cognates: A.-Sax. bund “bundle”, bindan “tie, bind, fetter, fasten, restrain”, etc., MDu. bond, bondel, binden “bind”, Gmn. bündel ditto; Sl. (Rus.) bant, bantik (dim.) (áàíò, áàíòèê). The Gmc. forms carry versions of the Türkic stem complete with the Türkic suffixes -t, -l and -en, a noun, passive and reflexive markers respectively. The limited and peculiar distribution covers an insignificant number of the IE languages. The claim of a “PIE proto-word” is totally incredulous. The suggested IE etymology of some aspirated form of “PIE” *bend- “bend” is candidly irrational, unwittingly it is a clone of the Türkic verbal form bantı “bind, tie” a derivative of ba:-.

English bust (v., n., adj.) “ruin completely, break into pieces, apart”~ Türkic bastur- (v., n., adv.) “crushed, subdued, suppressed”, a causative form of the verbal stem bas-/ba:s- “press, suppress, overcome, seize (rape)”, see push. The panoptic verb numbers 30+ meanings related to applying downward pressure. A general connotation is a definitive “final, end” of some action. The expression “finish (something) off” corresponds to a calque of the Türkic bastur-. Cognates:: A.-Sax. basnian “wait (in ambush)”, Eng. (1530s) baste “beat, thrash, finish off”, OHG besten, Sw. basa “to beat, flog”, bösta “to thump”, ONorse beysta “to beat”, all with connotation “end, kaput, finish, busted”; It., Sp., Port., but not Lat. basta “end, finish”; Gk. bastazo (βαστάζω); Sl. (Pol., Rus.) basta “end”, Rus. bastryk “subjugation”, Pol. basalyk “lead-ended whip”; Arab. basma “printed fabric”, “stamp, brand, imprint”; Mong. basu “quit”, Kor. matta, mağa “break, crush”; Tungus, Manchu basala- “kick” The form basting “strike violently” semantically and phonetically closely parallels the Türkic bastïq “to be defeated, broken, suppressed”. Possible cognates may be Bastarnae, a tribe at the end of the domain, neighbor (Ptolemy), bastion “tip, at the end” with its derivatives, via Fr. No IE cognates, in Europe originally a peculiar areal distribution, the suggested derivative distortion of burst (barsten, bærst, bersta, berstan, borsten, brestan, bersten) “erupt, rupture” is too far fetched, does not fit neither phonetically nor semantically. The suggested derivative of beat is not any better. The word known in England in 1530s suddenly popped up in 1750s in the US, and became a popular economic and action term. See ambush, bat, pat, push.

English butt (v., n.) “thick end” (Sw N/A, F1415, 0.01%) ~ Türkic büt-, bit- (v.) “come to an end, end, end up, befall”. Cognates: A.-Sax. buttuc “end, small piece of land (probably a tip of a lot)”, MDu., Du. bot, LGmn. butt," ONorse bauta, ONorse butr “short (end)”; Ir., Gael. butt; Welsh fytiwyd; Fr. bout; Sl. (Rus.) bodatsya (áîäàòüñÿ) “butt, gore”; Bulg. bitisam “end”; Serb. bitisati “pass, die”; Mong. bütǝõǝ “complete”, bütxǝ “finished”. The Türkic system of suffixing allows creation of vast volume of derivations, eg. bütün whole, entire; bütmäk ending, termination, fulfillment, etc. So is its use in English: butt in, butt out, button, buttress, butt-weld, button, buttock, and so on. The IE etymology is badly lost, vague, and incidental more than is warranted, observing incidental conflations (button - bud) and conflating unrelated (button - Fr. boter “to thrust”) words. Or connects it with “digging”. The age of the lexemes is indicated by proliferation of the Türkic suffixes, pointing to the formation of the words like button still in the Türkic linguistic milieu: suffixed -on, -ta, uch, etc. The original semantic association with the human butt is attested by the Türkic derivative form bütgü “baby excrement” (sounds much like the modern “butt goo”). The distribution of the word, from Mongolia to Atlantic, and evading the Asian IE areas, makes etymological association with the Kurgan Sarmatians unavoidable. The Celtic fytiwyd may be an echo of the original büt-.

English cackle (v., n.) “hens shrill squawk” ~ Türkic kakı:la:-, kakı: (v., n.) “cackle”. Cognates: none besides Türkic languages, only English; no surviving Gmc., European, or IE cognates. The Türkic denoun verbal suffix –la had internalized into an Eng. auslaut –le for both verbal and noun forms. The phonetic and semantic match is perfect. Chance of pre-historical borrowing fr. English into the bulk of the Türkic languages is absolute zero. No IE etymology, suggested “imitative” is probably true, but is useless in respect to the familial origin of imitation: nowhere else the chicken squawk is imitated by a cackle-type form. This word belongs to the lexical cluster of Turkisms that was inherited by English independently fr. other Gmc. languages, and may be a useful marker for linguistic and demographic tracings.

English cagy (v.), cagey (adj.) “evasive, reticent” ~ Türkic qač (qach) (v.) “avoid, shun, escape, disappear”. The form qač is related to the allophonic basic notion köč/göč (n.) “coach, carriage, wagon, house”, (v.) “ride a coach”, “migration, travel”, ultimately fr. the Türkic base qo/qon “coach, migrate”, see coach, come. Türkic qač has numerous derivatives, due to the character of surviving records a best recorded is “slip away, escape” from enemy. Cognates: Finno-Ugrian correspondences, Rus. dial. qočqa (êî÷êà) “escapee”; adding cognates for “coach” and its siblings would convert a miniscule listing to a massive case, from Sw. kusk to Ch. Gaoche 高車. No IE parallels, no surviving Gmc. parallels. The Türkic origin is semantically pinpointed and phonetically precise. See coach, come, go.

English call (v.) “verbal touch” (Sw N/A, F162, 0.16%) ~ qol- Türkic (v.) “call for, ask, beg, pray”. The Türkic stem qol- is apparently a derivative of the noun qulaq “ear”; that is corroborated by Hu. cognate hal “hear”. Over the ages, the verb has developed a respectable derivative tree. For the verb, English has an assembly of 25 semantic meanings. Germanic forms with auslaut -g are allophones of the Türkic deverbal noun qolɣ formed with a suffix -g-/-ig-/-yg-. Cognates: A.-Sax. ceallian “to call, shout”, Du. kallen “to talk”, ONorse kalla “to cry loudly”, OHG kallon “to call”, kalzen, kelzen “talk, brag”, klaga, Gmn. klage “complaint, grievance, lament, accusation”; OIr. kalla “calling, singing”, Welsh galw “call”; Cimr. galw “calling”; OCS glasit (ãëàñèòü) “say”, glagolit (ãëàãîëèòü) “speak”, klikat (êëèêàòü) “call”; Gk. klisi (κλήση) “call”; Skt. garhati “bewail, criticize”. Another cognate of qol-/call- is the noun and verb gloss: A.-Sax. glesan “to gloss” MLat. glossare, LLat. glossa, OFr. gloser, “speak nicely”, lit. “say it, explain voicing”. The IE etymology recognize it for Cimr. galw and OCS glasit, but not for the MLat. glossare manifestly confused etymologically with the notion of “thorn”. The word has been around for quite a while: the Cimmerians of the 10th c. BC are youngsters against Skt. of the 16th c. BC and the Celtics of the 28th c. BC. Against their dated counterparts, the Germans are late arrivals. What unites these diverse people is that they are all living on the outskirts of the great steppe, bordering, occasionally amalgamated, and at times being the original Türkic tribes. The word is non-IE. Most of the IE languages do not have parallels, and those few that have have historically or biologically documented Türkic links. The IE etymology, both English and Slavic, in O. Maenchen-Helfen's favorite expression, is pure galimatiaa, piling up all allophones in one uncouth heap. In a feat of paradigmatic transfer, Eng. possesses four main action words related to communication: say, tell, call, and gabble, the direct siblings of the Türkic söy-/söyle-, til-/tili-, qol-, and gap-/gapir-. These are in addition to the cognate words for animal utterances and other sounds. Although overlapping and interchangeable to some degree, each one conveys its own distinct spectrum of very basic communicative notions. The Türkic origin of the word is irrefutable. See gabble, say, tale, tally, tell.

English calm (v., n., adj.) “quiet” (Sw N/A, F804, 0.01%) ~ Türkic kam-, ka:m- (v.) “weaken, lower”. Semantics depicts a change in intensity (i.e. wind died, heat or cold diminished, sea calmed, etc.). Of the word's five semantic clusters, the above cluster is the oldest, the other four are derivative. The phoneme -l- in the stem probably simulated a long -a- also rendered as -aa-. Cognates: A.-Sax. col, coll “calm”, cama “muzzle”, Norse calm, OFr. calme, carme, OIt. calma “tranquility, quiet”, Sp. (and other Romance languages) calma; Malt. ikkalma, Sl. (Ukr.) (v)gam(uvati) ((â)ãàì(óâàòè)), (Russ.) (u)gom(onitsya) ((ó)ãîì(îíèòñÿ)); Est. külm(avereline); Uzb. jim, Az. həlim; Mong. (nam)gum ((íàì)ãóì), all “calm”. The etymologically cited LLat. cauma “heat time (siesta)”, Gk. kauma “heat”, kaiein “to burn”; spelling influenced by Lat. calere “to be hot”, are unrelated to the notion “calm”. To the detriment of its own objectivity, the circular IE “etymology” omitted numerous calm-type cognates scattered across Eurasia. In the last LLat./Gk. example on fabrication of an IE “etymology”, the only savvy element is its ingenuity. The idea of the Romans carrying their “hot siesta” to the fringes of the Mongolian desert to express “quiet” extends far beyond absurd waiting to be quietly rescinded. A.-Sax. had plenty of native words for “calm” with a rich complement of derivatives, attesting that “calm” was a demographic oddball: acelan, astilllan, bliðe, fedlice, and more. Many of them are still active in English with their prime semantics. For a notion “calm sea” A.-Sax. already carried native dumb, cool, still, tame, blithe, fade, mellow, row, soft, plus now lost swig “silence”. Apparently, the forms for the notions “calm” and “cold” were mixed up, resulting in ambiguous col/coll “cold, calm”. True cognates are scattered across Eurasia, across linguistic groups, with European scene a minor remote appendix. In a sane world, a notion of “sea on siesta” for the “calm sea”, and the like dotty leaps of imagination would not make sense. The Greeks did not need to derive “quiet” from “burn”. Distribution of cognates is peculiar and suggestive. First, the word does not belong to the IE family, is an oddity among the mass of the IE languages, and is peculiar only to a handful of specific European languages. The “PIE” etymology is utterly unsustainable. Secondly, the Romance group probably received the word via Burgund nomads, in the second part of the 1st mill. AD. Thirdly, all cognates are distributed among peoples located along or adjacent to the steppe belt, and known to have horsed nomads in their midst during historical period. Phonetically, the Azeri form is a first-line candidate for dissemination, it is connected with As-eri Scythians (Ishguza/Ashguza Scythians in Sakasena, modern Azerbaijan), Ashkenazim Scythians (אשכוז’ škuz and אשכנז’ šknz, Hebrew, Biblical records) and with the Scandinavian folklore of the latter days. Phonetically, most European cognates introduced an inlaut -l-, while the Sl., Mong., and Uzb. forms, and the modern English with a silent -l-, do not use it. The absence of cognates in the Celtic languages allows to suggest that the form kalm-/kam- evolved after 5th mill. BC, after the Celtic departure from the Eastern Europe.

English calumniate (v.) calumny (n.) “malicious charge, trickery, subterfuge, misrepresentation” ~ Türkic čîlvu, čólvu (v.) “defame, disparage, libel”. Ultimately fr. a verb čîl- “weaken”, and its derivatives related to “damage”, Cf. čolmaq “vice, defect, disable”. In the Manichaean Christianity lingo, and with the advent of Islam, it came to designate blasphemy and became a popular daily mantra. Cognates: A.-Sax. hol (n.), Goth. holon, holian (v.), OHG huolian; MFr. calomnie “slander, deceive”; Lat. calvi (v.), calumnia (n.), Gk. kelein “bewitch, seduce, beguile”; Sl. allophones OCS, Serb., Croat, Bulg., Czech, Slovak, xula, xoula, etc. “slander, deceive”. The Lat. form calvi “to trick, deceive” matches exactly the Türkic čîlvu in phonetics and semantics, leaving no doubt on the origin of the word, the Lat. calumnia (n.) is a Lat. derivative of calvi, it produced MFr. calomnie, and Eng. calumny. There is no need for an unattested “PIE root” *kel- “conceal”, the Türkic word čîlvu is still active, its numerous allophones attest to its geographical, temporal, and dialectal diversity. The ONorse hol “praise, flattery” saliently falls out from the semantic uniformity across numerous languages and timespans, it may be a phraseological caveat for “unflattering”. The transition Türkic čîlvu > Lat. calvi > Lat. calumnia > MFr., the A.-Sax. holian (v.), calomnie > English calumny clearly demonstrate the paths and directions of the phonetic shifts within extremely narrow semantic field. The ubiquity of the word in time and space attests to the ubiquity and importance of the calumny in the life of numerous societies from at least the 1st mill. AD to the days of enlightenment.

English can (v.) “able to” (Sw N/A, F51, 0.41%) ~ Türkic qan-, ka:n- (v.) “happen, occur, meet a desire”. The verb has at least 23+ meanings in 5+ clusters. A main cluster denotes a notion “can (execute, achieve a desire)”. A secondary cluster denotes a derivative notion “know”. That created a bifurcated semantic split of “can” and “know”, see know. For uninitiated etymologists, situation is confusing and confused. What appear to be homophones are two offshoots of a single parental verb. The Gmc. cognates are semantically split, and are unknowingly treated as homophones. Cognates: A.-Sax. can, con “can, be able to”, and “learn, know”, ONorse kenna “to know, make known”, OFris. kanna “to recognize, admit”, Gmn. kennen “to know”, Goth. kannjan “to make known”; Mong. qan-, qanu- “meet a desire”. More cognates are provided by derivatives in a range of languages, Cf. Türkic qantar, Manchu qantara “impatiently wait (desire)”, Türkic qantar “caprice, whim”, “tilt, twist”. Distribution of the can “able”, peculiar to the Germanic and Türkic languages, proves its linguistic affiliation. Versions of the form and semantics cant “tilt, twist”, rated as “a word of uncertain origin”, are found in many languages: Teutonic, Slavonic, Romanic, Celtic, Latin cantus (OED). In the east, the older notion “can” retained its frozen in time primacy. In the west, A.-Sax. carried the conventional bifurcated semantics eidetic with the Türkic original. A common trend starting with the Goth. was a Gmc. shift to the semantic prong “know”, while the older words haban “have, possess, hold” and magan/mahteigs/mahtig “might” filled the function of “can, able”. The near-perfect phonetic and perfect semantic congruence attests to the common origin of the Eng. word can “able”. Like the can “able” and can “know” separately, and the A.-Sax. pair “able + know”, the magan “might” came from the Türkic linguistic phylum, an authentic case of paradigmatic transfer to the Gmc. languages. See -able, I, do, know, might, this.

English capture (v., n.) “take” ~ Türkic kaptur, kapdur (v.) “seize, embezzle”, causative form of the verbal base kap-, qap- (v.) “seize, grab”; qapsa- (v.) desiderative form of kap-, qap- “surround, encompass all sides”; hapset-/hapis “capture (v., n.)”. Ultimately a verbal form of kap “bag, pouch”, later “vessel, cup”. The notion “capture” ascends to traditional Türkic methods of encircling hunt. In addition to the direct “take” from different angles, the verb produced passive and metaphorical extensions: “trapped”, “catch (an infection, an idea)”, “bite”, etc., selectively captured by the receptor vernaculars. Cognates: A.-Sax. hæft “take”, hæftling (n.) “taken”; OIr. gaib “grasping”, gabaim “I take, grasp”; Lat. captura, capere “take”; Sp. capish “capture (the meaning)”, fr. Türkic kapıš- “capture, understand”, a reciprocal and co-operative form of kap-, qap-; Sl. hapat- (õàïàòü) “grab” with dialectal variations; Hu. kap-, kapni “grasp”; Finno-Ugrian languages also have correspondences (Nemeth, 1928); Arab. qabada; Mong. cognates were unjustifiably rejected by G. Döerfer; Mong. Kalm. xawl- “catch (ball), bite”. The Eng. capture and Lat. captura and captus “taking” (especially of animals) complete with Türkic causative/active suffix -tur attest to immediate inheritance from the Türkic. Distribution covers a handful of IE languages, cuts across linguistic families, and positively attests to a non-IE origin. Geographical distribution and ancient forms indicate an ancient Turkic lexicon in the area of the Italic, Gmn., and the Slavic branch of the Baltic languages, possibly of Hunnic period, with Lat. ascending to much older acquisition. An IE etymology is non-existent, a faux “PIE proto-root” *kap- “grasp” solely parrots the attested Türkic kap-, qap- (v.) “seize, grab”. A hardline M. Vasmer in desperation suggested an unattested faux “IE proto-word” *-khar to parrot some eastern articulations. Phonetic consonance is striking, and semantic match is perfect. Türkic has numerous terms and derivatives with wide geographical spread of the terminology related to encircling hunts, ascending to the ancient hunter-gatherer society. The anlaut reflexes h-, ñ-(k-), ñh-, õ- indicate transmission of Turkic initial glottal h- (q-) with dialectal variations, and the Gmc. -ft may reflect the original Turkic form hapset-/hapis presently spelled -pset in Romanized transcription. The OSl. form hapyashte preserved the Türkic deverbal noun suffix -č. The OIr. gabaim came complete with an agglutinated Türkic suffix -im “me”. The hunting word qapsa “encircle, surround on all sides” is paradigmatically connected with the English “circle”. See circle.

English care (v., n.) “attend, be concerned, safeguard” (Sw N/A, F216, 0.09%) ~ Türkic kör-, gör-, qara- (v.) “look after, care for” . Ultimately fr. a kVr-/kVs- type prime verbal cluster variously articulated k-/g- anlaut and ö/ü/a vowels, connected with aspects of vision. A branch of verbal allophones kör-/gör-/qara- relays a prime semantics “look (attentively)” (Cf. stare, gaze, see), and semantics “care for” among various downstream extensions. The form qara- of the cluster is validated by a derivative word qaraɣu “watch, patrol”. A complimentary word of the same semantic branch is sorrow, scare of the Gmc. linguistic trunk, see scare. The Eng. idiom “look after” is a calque of the Türkic expression. Cognates: A.-Sax. carian, cearian “care for”, carseld “home of care”, OSax. karon “to care, to sorrow”, OE. cearu, Goth. kara, OHG. chara “care, anxiety”. OED erroneously emphasizes that albeit it is a suitable match semantically and phonetically, it is in “no way related to the Lat. cura “care, concern, trouble”. Instead, OED should emphasize that it is in no way related to the notion of “scream”, and any other subject semantically unrelated to care, there is no need to conflate phonetic resemblances, like an unattested “PIE root” for “cry out, call, scream” fr. Ir gairm “shout, cry, call”, Eng. garrulous “chatty, talkative”, Du. karig “scanty, frugal”, OHG chara, charon “to lament, wail”, Gmn. karg “stingy, scanty”, and the like. They all deserve and have their own cognates. It would be honest to declare a routine “of unknown origin”, with no IE connections. Türkic has two phonetically close distinct siblings with close, but different semantics, a qorq “be concerned, safeguard, be afraid for” and gӧrg “scare”, which apparently conflated in different phonetic structures of different languages. The A.-Sax. carig “sorrowful, anxious, grievous” is a NW European form of the Türkic qorq “scare, fear, panic, horror, phobia”, and Gorgon is a Greek Mediterranean form. See caginess, gaze, Gorgon, scare.

English carve (v.) “cut by chipping away at a surface, engrave” ~ Türkic kert- (v., n..) “incise, carve a mark, engrave”. A.-Sax. ceorfan, cearf, corfen “to cut, cut down, slay; to carve, cut out, engrave”, OFris. kerva, Du. kerven, Gmn. kerben “to cut, notch”; Gk. graphein “to write”, originally “to scratch” on clay tablets with a stylus. The A.-Sax. cweorn “millstone”, i.e. “scratch pad”, cweornbill “chiesel”, quern, and their European cognates probably are remote derivative siblings of the kert. All various forms are consistent with the Türkic kert phonetically, and perfectly consistent semantically. The auslaut consonant variously takes forms -t/-f/-v/-b/-ph, probably adopting consonant to the local phonetic conditions. See cut, curt, short.

English cast (form) (v., n.) “form in mold” ~ Türkic qïsdï (qysdy) (v.) “form in mold”, a form of the verb qïs- “squeeze, press, compress, clinch, force, coerce, restrain, crush, suppress”, -dï is predicate and analytical formant. Cognates: Du. giet-, Sw. gjut-, Gmn. guss; It. ghisa; Gk. chytos- (χυτοσ-); Balt. (Lith.) ketus, all “form in mold”; Mong. kisa “furnace”, Bur. xyaha “furnace”, Kalh. xyas “crucible”; Manchu xuža “furnace”. Since Türks did not invent a bronze metallurgy but carried its technique from Carpathians to around the Eurasia, they are an only candidate to bring it to the Far East (Cf. documented Zhou metallurgy to China) and to the Far West. The future Germans of the Corded Ware were on the far fringes of the main trade routs connected with the spread of metal casting, and probably learned its terminology from some intermediaries, probably of Türkic origin. English has retained a trace of the suffix - in the form -t; the others contracted -st to -t, or retained the stem qïs- (Gmn., It.). Various European spellings of the anlaut consonant reflect attempts to depict the uvular consonant. The It. form differs from the Lat.-Romance model, it is likely of Gmc. provenance. The IE etymology confuses homophones cast “form in mold” and cast “throw”, and even then can't come to a rational origin. The duo of cast (v.) “send forth” and cast (v., n.) “form in mold” demonstrate a remarkable case of paradigmatic transfer and creolization, where an entire semantic group (qus-, qïsdï) is transferred in its entirety while phonetic simplification created homophones out of phonetically distinct originals. The oldest Türkic castings and casting lingo were carried by the “Zhou Scythians” via Mesopotamia to the Altai and on to China, bringing ca.1750 BC the casting technique, including the monetary knifes, to the Shang China. The word probably originated in the 4th mill. BC not too far from the Carpathians, the center of the European metallurgy, was spread far and wide by the horsed nomads who brought it first to Altai and then to the Far East, and was carried to the Central and eventually to the Western Europe by different horsed nomadic groupings. Neither the Celts nor the Sumerians would have known the term, they left N. Pontic too early, but the Mesopotamian horsed nomadic Guties in the 3rd mill. BC already mastered a perfected casting of socketed joints.

English cause (v., n.) “origin of something, producing an effect, motive (Sw N/A, F357, 0.04%) ~ Türkic köze:- (v.) lit. “stir up burning embers”, “get charred” (see char), used mostly metaphorically. Köze:- is a denoun verbal derivative of kö:z/köre “embers, burn”, a derivative of the verbal root kö- “burn”. It phonetically echoes derivatives kön-/köy- “burn” and kol “coal” (see coal). The essence of the notion is “conflagrate, stir up, inflame, foment, instigate”. Cognates: Ir. cuis, chuis; OFr. causer “to cause”, MLat. causa (n.) “cause”, incaendefacio “torch”; Latv. celonis; Maltese kawza; Gujarati kojha (કોઝ); Mong. shaltgaan (øàëòãààí). The last three words, the Latv., Gujarati, and Mong., appear to be derivatives directly from the word kol for “coal”, in line with semantic connection and skipping the “embers” stage. The IE etymology is a routine “of unknown origin”. Distribution of the word indicates both colloquial and global character, with spotty appearance across Eurasia. The EDT cites only 3 groups that use this verb: Turkmen (i.e. Aral basin, consistent with other geographical associations), Khakass (i.e. Enisei Kirgiz, another link consistent with other associations), and Koibal, a small off-Khakass branch of recent provenance, along with other scatter from Atlantic to Baikal. Historically and philologically, such “rare” words may be very helpful in diagnosing genetic origins not only of the English people, but also of the Ephthalite Huns (Gujaraties) and Latvians. The Ir. form with /sh/ could originate from few closely related words: kol “coal”, köz “embers”, and the like. Paradigmatic transfer of etymologically related triplet, a metaphorical cause and substantive char and coal indelibly attests to a common genetic origin from the Türkic milieu. See char, coal.

English challenge (v.) “confront” ~ Türkic čalïš (chalish) (v.) “call to fight”, challenge to a hand to hand match. Ultimately from a stem čal-, talq- “shove, thrust down” with cooperative/reciprocity suffix -ïš, lit. “fight together”. Challenge “a call to fight” is recorded in English from 1520s, it is eidetic to the Türkic čalïš, in the modern lingo it corresponds to the bar-room invitation “let's step outside”. In the Türkic traditions, wrestling and single combat were a crucial part of any festivity. In most European languages, “challenge” is rendered as some local form of “call”, Cf. Sl. vyzov. Cognates: A.-Sax. clepe, beclepe, clipian “challenge, lawsuit”, lit. “call”; OFr. chalonge “challenge, call to fight”. A paucity of cognates provides an ultra slender path for diagnostic tracing leading to a similarly slender distribution on the other end. Interestingly, Ir. dushlan echoes the Türkic dushman “opponent”, Gael. dubhlan is very similar, and the Welsh term (herian) is akin to “manliness”, Cf. German “manly, brave, strong”. A suggested “IE” etymology of the word is unconvincingly frivolous: etymology is ascribed to Lat. calumnia “trickery” via VLat. calumniare “to accuse falsely” (see calumniate) with a brave jump to unrelated OFr. chalonge “challenge”. Challenged phonetically, there is also little semantic connection to an invitation for a clash on a carpet or in court. In the absence of the IE etymology, the Fr. chalonge directly ascends to the Türkic čal-/čalïš, with some local Gaulic articulation. The preserved auslaut suffix -ïš, reflected in the French -onge and English -enge, in the original substrate language could have had an allophonic form -ïch/-ïj/-ïg/-ïk/-ïkh etc., it is a suffix of reciprocity, transmitting the sense of “us, them” (go, fight, eat, etc.). A related allophone čalaŋ (chalang) “babbler, chatterbox” with a negation suffix -aŋ (see –un) is opposite of challenge for a nonchalant “unruffled, oblivious, d ismissive”. Its suggested IE etymology fr. Lat. calere “hot” and Fr. chaloir “care” is most dubious. See calumniate, chalant, –un.

English champ (v.) “chew noisily” ~ Türkic čap- (chap-) (v.) “noisy action”, including “chomp, chew noisily”. The origin of the Türkic čap-, with attested forms čàð-, čat-, čïb-, čub- and probably more, comes not from chewing, but from a whipping, “whip, lash, click”. In an auditory sense that means “chat, click”. The verb champ corresponds to the original Türkic semantics of chatting and clicking associated with a whipping sound to control herds. Cognates: Balto-Slavic chav- (chavkat, ÷àâêàòü) “chew noisily” (b/p/v ~ m alternation). The IE etymology is “probably echoic”, it is likely true. But an absence of the champing echo in the IE languages points to a specifically Türkic echo: Romance mordendo, ronge, Gmn. kauen, etc. The kauen is a form of the Türkic kev-, kevsa- “chew” (v.) preserved with somewhat derisive meaning, nowadays supplanted by regular beißen. The Balto-Slavic form, active in the Sl. languages, descended from the Türkic verb likely brought over to the British Isles with the Anglo-Saxon speech. No connection to champagne.

English char (v.) “burned, turned to charcoal”, (n.) “charred” ~ Türkic öčür- (v.) “quell, extinguish (fire)”. Ultimately a causative form of öč- “quelling (a fire)”, semantically connected with o:t (o:d) “fire, anger (metaph.)”. Literally, a compound charcoal with elided anlaut ö- is an allophone of the compound öčür kül “quenched ashes, cinders, coals”. The importance of the charcoal for the incipient metallurgy can't be overestimated, entire forest tribes were engaged in production and supply of the charcoal to the metal industry. Cognates: A.-Sax. cer (/č-/), cerr (/č-/), ceorr (/č-/), cierr (/č-/) “churn, turn, change”, later charren “turn (to coal)”; OFr. charbon “charcoal”; Lat. carbo “charcoal”; Mong. ečüd, ečül “douse, quench, end, quell”. The A.-Sax. word was a stack of two homophones from two independent sources, one a Türkic “churn, turn, change”, the other a native “affair, business” unrelated to the notion “char”. The unaccented initial ö- elided at the time of internalization. The A.-Sax. c- was articulated as either /č-/ or /k/. Eng. has preserved the quality /č-/ of the initial consonant. No IE cognates, attesting to a non-IE origin. Very peculiar etymology allows to trace the word to its early ethnic origins. Char is a member of a four-word fire-related paradigm transferred from Türkic languages: öčür- “char”, öč “ash”, kül “coal”, “ashes, cinders”, kandil “candle”. A conservative probability estimate for the case of these four words (15 phonemes) accidentally appearing in two independent languages is 1 chance out of 1025, one trillionth of 10 trillionth, a chance infinitesimally small, for details see ash. Char is a member of a huge cluster of the European words starting with char-/kar- and dealing with burning: carbon, cremation, etc. In a perverted sequence, the IE etymology ascends char to charcoal, failing to come up with a rational etymology. It ascends char to an unattested faux “PIE proto-word” *ker- “heat, fire”, demonstrating complete misunderstanding of the linguistic phylogeny, with a circular logics based on false preconceptions. In the process, it unwittingly “reconstructs” an allophone of the original attested form of the Türkic öčür-. The importance of the word carbon for the Eurasian languages can't be overestimated. Notably, in addition to the amalgamated or adopted form of the word carbon, nearly every Eurasian language has retained in continued use its own synonym. The adopted word is used as a wad of concrete nouns and verbs in the cultural borrowing portion of the lexicons. The words carbon and cremation are now truly international words. See ash, candle, coal.

English chat (v., n.) “small talk, gossip” ~ Türkic čat (v.) “make thud (sound), noise”, any resemblance to cracking sound (chatter, chirk, chirp, chirrup, twitter, etc), derivatives  čatla- (čatu:la:-, čatı:la:-, satu:la:-, šatu:la:-) (v.) “chatter, talk non-stop, chirp”. The Türkic notion of “buzz, small talk” and the like can be expressed singly, reduplicated, or in combination of elements čat, čet, čıt, čit, see chitchat. The alternation s/sh/ch is a regular dialectal event in Türkic languages, the s-/š- <-> ch- transition likely occurred within Türkic vernaculars. Another prominent expression of čat is contained in the term for “cat” četük that formed the European, Asian, and African term cat with its many allophones, see cat. The root čit/chit is likely unrelated to the word chit “improper girl”, or to the word chattel “private movable property”, a derivative fr. the root čat- “gather, assemble” (see chattel), but formed the notion cheat “practice trickery or fraud”, that is “talk somebody into doing something (for their own disadvantage)”, and chatter “din, twitter, gossip”. No IE etymology, practically no presence in the IE family, thus obviously of non-IE origin. Cognates exclusively Gmc.: A.-Sax. none recorded, Du. koeteren “jabber”, Dan. kvidre “twitter, chirp” point to an original anlaut consonant that could develop into affricate or plosive. The č-/k- alternation parallels that of the form četük/cat. A primeval connection with the chirping of the birds and cat miaul have endured for the birds, attesting to an initial echoic origin. The words “chat” and “chitchat” entered Eng. as a verb and a noun, without any phonetic or semantic changes. Further derivatives arose as innovations in the course of Eng. evolution. They are paradigmatic transfer cases along with “say”, “tell”, “tale”, “saga”, “message”, and other words related to verbal communications. Each word separately, and the systemic nature of the paradigmatic transfer indisputably attest to the Türkic linguistic substrate. See cat, chattel, chitchat.

English chatter (v., n.) “gabble” ~ Türkic čatu:la:- (v.) “chatter”, fr. čatu (n.) “some sound, noise” or directly fr. čat (n.) “thud (sound), noise”, suffix -ur forms participle čatu:r corresponding to the Eng. form chatter. The verb čatu:la:- is attested in three allophones: satu:la:-, šatu:la:-, and čatu:la:-; the ethnic provenance of the form čatu:la:- is not clear, but probably follows the distribution of -š/-č alternation. The origin of the čat is obviously onomatopoetic, lit. “chatting, chattering, noisy”, its synonym is an alternate base name for a “cat” muš, from the cat's miaul, see cat, mouse, chat, and chitchat. No IE etymology, practically no presence in the IE family, thus obviously of non-IE origin. Cognates: A.-Sax. none recorded, OE (13th c.) cheateren, chiteren, chateren, Du. koeteren “jabber”, Dan. kvidre “twitter, chirp”. The č-/k- alternation parallels that of the form četük/cat. A primeval connection with the chirping of the birds and cat miaul have endured for the birds, attesting to the initial echoic origin. The word “chatter” entered Eng. as a verb and a noun, without any phonetic or semantic changes, further derivatives arose as innovations in the course of Eng. evolution; they are members of the paradigm transfer along with “say”, “tell”, “tale”, “saga”, “message”, and other words related to verbal communications. The systemic nature of the paradigmatic transfer indisputably attest to the Türkic linguistic substrate. See cat, chat, chitchat, chattel, mouse.

English check (v., n.) “examine, ascertain, mark” (Sw N/A, F396, 0.04%) ~ Türkic ček- (chek-) (v.) “separate, identify, mark with markers (dots etc.)”. A notion is eidetic with idioms “dot the i's” and “check the boxes”. Per G. Clauson, the word numbers some 30 meanings, a few of them are cited, with fuzzy semantics. The OTD offers semantics “pick (v.)” and “mark (v.)”. The verb carries a bifurcated notion connected with implied actions of notching and picking related to taking toll. It probably originated from counting on fingers, and then for marking similar objects one at a time. Literal and metaphorical derivatives, at times very remote, fit into bifurcated semantics. Concrete applications reflect realities of the times: “mark (with diacritics)”, “inspection (military)”, “pick (separate)”, “draw (gambling)” etc. Numerous Eng. derivatives are semantically connected with “secure, verify”, rather than “count”: bank check, hat check, checking account, hotel check in, checkup, rain check, double-check, spell check, checkout, checklist, checkpoint, paycheck, unchecked, checker, etc. They all are completely distinct from and inconsistent with a fancied etymology “threat, attack”. Their semantics is a world away from the gaming “threat, attack” innate to the chess terms “check” and “checkmate”. Cognates: None cited. Within the IE paradigm, the word is routinely confused with gaming semantics “threat, attack, block” of the homophonic check, checkmate related to OFr. eschequier “chessboard”, MLat. scaccarium “place for (chess peice) shah, king”. Like a related name for the game checkers, the Eng. homophonic chess term check comes from the Pers. shah “king”. For the connotation “secure, verify” neither an Arabic nor Pers. shah (i.e. check) mat “king is killed” make any sense. It is grossly anachronic (ca. 1300) against the indispensible in trade and commerce since most ancient times Türkic ček-. Thus, confidently no IE parallels whatsoever. The misleading etymology profanes the great place this rich and productive word occupies in the English language.

English chew, chaw (v.) “manducate, masticate, ruminate” ~ Türkic kev-/gev- “chew, chaw, ruminate”. Ultimately a denoun verbal derivative fr. a homophonic kaw “dry grass”, i.e. out-of-season horse food, “dry”, “dead wood, tree” . The word came to us from a dawn of producing enterprising. It left an unmatched-long trail of derivatives and allophones like ku-/kug-/quɣ-/êîó etc. The term was of a prime importance for people and their sustenance source. Managed herds of pre-domesticated wild horses survived on grass from under a cover of snow. The practice of grazing horses from under snow extended to the domestication time and well into the present. For a better survival of the herds they were driven to winter quarters in the warmer and greener lowlands at river deltas and lakes. For pastoralists, that was a practical solution, preservation of their herds without toil to fodder the animals. From the shades of the notion chew it appears that the word eat was primary and included the notion chew, and a notion chew was a later development connected with ruminants and rumination, complementing the initial notion. That conjecture is supported by the exceeding wealth of the nouns and verbs chaw, cud, quid that point to rumination rather than to human chewing. That conjecture is also supported by the Balto-Sl. and Pers. examples with forms for chaw connected with related jaw. The word kev-/gev- has survived into the Eng. cow and Gmn. kaff. Via various paths, the Türkic notion of “chew” came to Eng. as a part of a wide paradigm of the food consumption: eat, cheek, chew, jaw, manducate. The k-/ch- transition is peculiar to Eng. and probably originated still in the Türkic milieu. Cognates: A.-Sax. ceow(an) (with ch-) “bite, gnaw, chew” (unlike the A.-Sax. ligature cs- unambiguously denoting phoneme sh, the A.-Sax. anlaut c- depicts the phoneme ch without marking a modifier), cow (with ch-) “chaw, food”, cu, cu(e), cy, cus, cue, cu(n)a, cyna; cum “cow”; cubyre, cucealf, cueage, cuhorn, cuhyrde, cumeoluc, metecu “cow-shed, cowcalf, cow eye, cow horn cowherd, cow milk, met cow” respectively; MDu., Du. kaf, kauw(en), OHG hewi “hay”, kiuw(an), MLG keuw(en), Gmn. kau(en), Kaff “chew”; Goth. hawi “hay”; Balto-Sl. (Lith.) žiau(nos) “jaws”, (OCS) živo “chew”, Rus. kovyl (êîâûëü) “grass (type)”; Pers. jav(idan) “chew”; Mong. qag “dry”, qawdan “dry grass”; Khalkha hagd “dry grass”. An unacknowledged cognate of the chew is cavalry, lit. a derivative of “(riding) a ruminate, chewer” with its trail of cognates.. A truncated form qu- is used in A.-Sax., Gmn., and in Kipchak and Altaian, a version hu- in Khakass (Sevortyan 1997 vol. 5-1 p. 169). The A.-Sax. forms attest multiple articulations of the anlaut c-, lining them up with the Gmc. forms as opposed to the Balto-Sl./Pers. forms. The form champ with b-/w-/m- alternation attests to connection with m- vernaculars. The line-up of “IE” cognates appears to confuse, in case of Balto-Sl. and Pers., the allophones of the word for jaw- Türkic čügtä, čügde:, čökdä “jaw” with the allophones of the Türkic kev-/gev- “chew, ruminate”, creating a false correspondence and consequently a spurious assertion for k-/ž-/j- alternation. The forms for the word eat (Türkic ye) are shared by Türkic and a number of IE languages, none of them include the notion chew, while the notion chew in the above languages was derived from two distinct concepts, one “ruminate”, and the other “jaw”, attesting to parallel and independent processes. The segregation between chew/chaw/chow and cud/quid appears to be artificial because their semantics is identical, and the forms eidetic, given the fluidity of the second consonant or semi-consonant like -y. The -y is frequently a second form for a number of weakly articulated consonants. The allophonic cognates for the form cud are A.-Sax. cudu and cwudu, where ligature cw- stands for q- or dg-; ONorse kvaða “resin”, OHG quiti “glue”, Gmn. kitt “putty”. Besides once again confined to the Gmc. branch, these semantic derivatives of the original kev-/gev- again find a parallel in the Sl. word žvachka (æâà÷êà) “cud, chewed mass”, a derivative of a Balto-Sl. žiau/živo. The IE etymology came up with few invented “proto-words” like *gyeu-, mimicking Gmc. roots and unwittingly inventing non-existing allophones of the real Türkic roots. All those oddballs are demonstratively unrelated to the bulk of the IE vernaculars or the IE family, these “reconstructions” are quite flagrant. A “restored Proto-Germanic” form *kaf- “chew” is identical to the attested Türkic kev-/gev- “chew”. See eat, cavalry, champ, cheek, jaw, menu.

Supplementary Note. Chance paradigm

Some statistics for an 1-word paradigm, a chance coincidence of a PG *kaf- “chew” and a Türkic kev-/gev- “chew”, and for a 5-word paradigm for food consumption: eat, cheek, chew, jaw, manducate.

Semantics: In a 10,000-word basic dictionary, the terms connected with the physical process of food consumption would not exceed 100. In actuality, that number would be closer to 20. The number 100 is extremely conservative. Next, in the food consumption semantic field of 100, the subset for physical oral grinding “chew” would not exceed 20. In actuality, that number would be closer to 5, thus the number 20 is extremely conservative.

Phonetics: A 50-phoneme language can be broken into 10 phonemic categories, with average 5 phonemes in a category. In actuality, most categories would be closer to 2 phonemes, thus the number 5 is quite conservative. In case of *kaf-, the forms gaf, haf, kaf, qaf; gef, hef, kef, qef,  etc., would all be deemed eidetic to *kaf-.

The accidental semantic match for 1-word paradigm “chew”in any language is Psem = 20/10,000 = 0.002.

The accidental phonetic match for a 3-phoneme *kaf- in any language is Pph = 1/103 = 0.001. Thus a 10,000-word basic dictionary would produce 10 statistical matches, ensuring that any 10,000-word language would have at least one match.

The simultaneous accidental semantic and phonetic match for “chew” and *kaf- in any language is P1 = Psem X Pph = 0.002 X 0.001 = 0.000002. In other words, it would take fifty (50) 10,000-word dictionaries to produce a single statistical match. The world's 6000 languages would produce 6000/50 = 120 languages satisfying the requirement. Not a small chance. In actuality, that number would be closer to 12, correcting for excessive conservatism of the premises. Still, statistically there is a chance of an accidental match.

For a 5-word paradigm for 3-phoneme word roots of the words eat, cheek, chew, jaw, manducate, accidental match probability P5 = (P1)5 = 0.0000025 = 0.000,000,000,000,000,000,000,000,000,032. It is an absolute impossibility, and it is an enormously conservative impossibility. Be it 10,000 times smaller, it would add no substance: beyond a threshold of possibility, 10,000 impossibilities equal the same 1 impossibility.

The verdict must be: The English eat, cheek, chew, jaw, manducate, and the Türkic ye-, čaak, kev-, čügtä, meŋ constitute a case of paradigmatic transfer. They are genetically connected and must have had originated from a common linguistic phylum.

English chill (v., n., adj.) “coldness, cold” ~ Türkic čil (chil), syil, ye:l (v., n., adj.) “wind, windy”, Uz. čilə (chile) “cool”. The forms čil “wind” and xaltarä “to freeze” are cited for Chuv. language, they are a sample of anlaut cornucopia č-/s-/š-/y-/j-/dj-/z-/ž-/d-/Ø.  The spread of phonetic forms and undifferentiated grammatical use attest to a most primeval origin of the word. A CT is y-type, č-type is second in popularity (including Khakass), s-type shows up in the Far East. In CT čil is vocalized ye:l with semi-consonant, and with “usual phonetic changes” and few vowel variations. Individual languages have their own semantic extensions. The chill and cold are allophones, with alternation of the initial k-/ch-/h-/x-/g-, and probably more. The form xaltarä in other Türkic languages echoes a form katur- “freeze”, fr. the verb ka:d- “freeze” and noun ka:d “snow-storm, blizzard”. The underlying notion of the stem ka:d- is “harden”, hence the metaphoric “freeze”. Alternatively, xaltarä may be a derivative of ka:r “snow” (like in Kar Sea “Snowy Sea”) with -r-/-l- alternation typical for the Chuv. language. All three roots, “wind/blow”, “freeze”, and “snow” are phonetically and semantically connected with chill and cold. In warmer climates, the notion čil/ye:l “wind/blow” was associated with “coolness”, and in northern climates čil/ka:d- “freeze” and “snow” were associated with “cold” and “chill”. The sound of wind is a fourth meaning “scream” of čil/ye:l. In Eng., the common form ye:l has produced words “yell” and “howl”, paradigmatically transferring the meaning of the wind's distinct “howl”, see howl, yell. The mechanism of metaphoric semantic extension, especially in the northern area with continental climate like that of the Chuvashes and their neighbors, fr. “wind” to “howl” and to “chill, cold” is identical. The notion of the inflicted cold has been preserved in the A.-Sax. and OE semantics for chill “feel cold, grow cold”. Cognates: A.-Sax. ceald (with ch-, cheald) (n.) “coldness, cold”, (adj.) “cold, cool”, (adv.) cealde (with ch-, chealde) “coldness, cold”, ofcalan (with ch-, ofchalan) “chill down”, Goth. kalds (adj.) “cold”, OHG. (adj.) kalt “cold”, Du. hàl “frozen ground”; OCS (adj.) hladn- “cold”, prohlada “cool (place, air, etc.)”, (n.) hlad, hlod “coldness”, (v.) ohladit- “cool down”, minor variations across Sl. languages; Lat. (adj.) gelidus; Pers. caidan (< ka:d-) “chill”, catarrh (< ka:d-) “cold”; Skt. hladate “refresh”, prahladas “cooling down, enjoy”, the Skt. forms are barely distinct fr. the Sl. forms both in phonetics and semantics; Hu. szel (sel) “wind”; Dravidian cali “cold”; Mong. salki (ch- > s) “wind”. The IE etymology and IE etymologists are thoroughly confused on the issue of chill and cold, offering conflicting asterisked “PIE proto-words” to jam reality into ordained ideological chimera. The oldest word is Middle Eastern-native Dravidian, believed to precede Semitic and Aryan migrations. Distribution of the cognates is manifest, it reflects a Corded Ware-type pan-Eastern European vernacular that split and migrated westward, eastward, and southeastward, with a finger extending to the Apennines. The direction and timing of the split have been reliably traced, the Indo-Aryan migration ca. 2000 BC, the Apennine migration ca. 1500 BC, the Sarmat migration ca. 200 BC. Languages that carried diagnostic markers were not necessarily those named out in the titles of the periods, they could have been those of the displaced, not of the displacers, albeit displacers could again superimpose over the displaced, at another time and in the new location. Flashes of primary and secondary events are reflected in biology (genetics), archeology, and later in historical records. The IE confusion is precipitated by a second European line of non-cognate “cognates” derived from the Türkic stem for “frost, hoar” and, of all the words in the world, “pendant”. The primary word for “pendant” is a cognate of Eng. “sag”, something that is hanging, drooping, and swinging. It produced the name for icicles, precipitation, and also for “shivering” and “freezing” (see G. Clauson, EDT, 1972, p. 826, silk- “shake, shiver” and salkım “pendant (adj.), cold, hoar-frost”). Etymological conflation of two terms with their own etymological histories does a disservice by using a postulate as a proof of that postulate. Hence the confusing cognates: Lith. (adj.) šaltas “cold”, Hu. szel (sel), Osset. (n.) sald “cold”, Av. (adj.) sarǝta “cold”, reflexes of the Türkic salkım, šarkım “cold, hoar, frost, pendant (adj.)”, Yakut šel “wind”, siel “hair, mane”, CT yelpi “wave, flatter, swing”, yelke “nape” (hair on the nape side, like horse mane), etc. Descendent from a different base word, the sil- line became synonymous with the ye:l line, spreading in parallel with it and, as is the case with Skt. vs. Av. languages, at times as demographic components of the same migratory events. The bulk of the population, naturally, remained behind, furnishing the markers of who, when, and where, datable by common genetic alleles in addition to the older clues. The NW European forms of the yel line demonstrate, first, the volatile nature of the anlaut consonant ch-/k-/h-, and second, their allophonic nature. They are dialectal forms ascending to the stem ye:l grown in the linguistic ambience of consonantal anlaut dominance. In that, they are eidetic with the analogous phenomenon typical for the Oguric languages, Cf. the same process for the homophonic “hair, mane”: CT (Turkmen, Gagauz, Turk., Az., etc.) ye:l, Chuv. čil, čilxe, Tuv. čel, Khak. čil(in), Yak. šel, šiel, the notion “hair, mane” reflects a visual aspect of the wind action (Cf. Romance pelo “hair”, Eng. pelt, i.e. something flattering under wind). The Ogur-type forms, and especially the Khakass (Enisei Kirgiz) form are diagnostic. The Ogur-type forms corroborate a thesis that the Huns' and Scythian languages were of the Ogur-type. The Khak. form is consistent with the other forms of the Eng. Turkisms. The Chuv. form čilxe “mane” is suspiciously close to the A.-Sax. form feaxe “hair, mane”, differing in the quality of the anlaut consonant and elided -l-. Taken altogether, the Chuv. forms point to their autochthonous nature, their ancestors arrived to the Eastern Europe from the heart of Asia in the 7th-6th mill. BC, carrying their predominant R1a Y-DNA allele and their archaic R1a Y-DNA Türkic language. In the middle course of the river Itil (Volga) they initiated the Middle Volga, Samara and Khvalynsk Pit Grave (aka “Kurgan”) archaeological cultures, the historical-cultural complexes of 8-6 thousand years ago and later. That flies in the face of the Chuvash presumptive association with the Far Eastern origin, where Mongols, Tunguses, and Manchu, and their relatives Koreans and Japanese, are marked by the Y-DNA group C. DNA links Chuvashes with the R1a Y-DNA Huns. That links them with the R1a Y-DNA Russians, who carry more native Chuvash blood than anything else. The Chuvash-Mongol linguistic similarities ascend to 93 AD, when 100,000 Hunnic families numbering 500,000+ people submitted to the ruling minority of Syanbi (pin. Xianbei 鮮卑) Mongols, essentially making the Hunnic language a language of the Syanbi. From the Eastern Europe, the R1a people expanded to the Balkans and to the heart of Europe, amalgamating with the local farming populations. Upon the arrival of their violent mounted cousins marked by R1b Y-DNA haplogroup, and after a period of the “killing fields”, most of the western settlers retreated back to the Eastern Europe. The common origin of the NW European forms is consistent with the identical premise within the IE etymology. At the same time, etymology does not need a mechanically produced faux “PG proto-word” *kal-, since it has real attested prototypes ye:l and čil ascending to Dravidian cali. The IE etymology does not dig into the origin of the putative *kal-, explaining something known a little with something totally unknown. Like many other words, chill was lurking in Eng. surfacing again to an educated light relatively late (1400s), while apparently cold was a more common or enlightened term that left more documented traces. Related: cello (violoncello), holla, hollo, holloa, holler, yelp (with allophones) and yowl, notable for episodic prosthetic anlaut consonant ch-/h-. The multi-layered paradigmatic transfer of clusters of meanings, variations, and a rich store of derivatives provide indelible evidence for the common genetic origin. See cold, howl, yell.

English chirp (v., n.) “high-pitched sounds” ~ Türkic čïlra, (v., n.) “jingle, clink, ping, ring”. Cognates: A.-Sax. cearcian (with ch-, chear-) “to creak, gnash”, ME chirken “chirp”; Sl. chirikat (÷èðèêàòü); Spanish chirrido, Rum. ciripit; Est. siristama; but Gmc. Dan. pippe, Du. piepen, Sw. kvittra, Norse kvitre, Gmn. zwitschern; other European people have 12+ different roots for the word. That shows inadequacy of etymological verdict “of echoic origin”. Different “chirps” produce different echoes, in case of the modest European peninsula 15+ different “chirps”. A maze of echoic origins should have the same trail of origins as any other etymologically scrutinized lexeme to be credible. Distribution of individual echoes should have the same scrutiny as any other etymologically scrutinized lexeme. In the case of the Eng. “chirp”, the distribution is very telling: “chirp” is very mobile and easily crosses linguistic barriers, it appears spottily in different European branches, has a spotty appearance in the NW Europe, spotty in the Slavic branch, and extends eastward along the Steppe Belt to the Far East. These pointers unequivocally point to the Türkic milieu as an only viable source for the initially echoic common origin. In contrast, presently there is no etymology other than the superficial “echoic”.

English chisel (v., n.) “carve” ~ Türkic čiz- (chiz-) (v.) “chisel, draw (depict), write”. Ultimately fr. one of two phonetically diverging basic forms in two geographical zones: čiz-/čïz-/-sız- and yaz-/jaz-. A third form is particular: ir-/i:r- “notch, scratch”. Instrumental suffix -l is of European provenance, Cf. A.-Sax. cisel “gravel”, cecel “cake”. Cognates: A.-Sax. none recorded, OFr. cisel “chisel, scissors, shears”; Croat chr- (crtanje), Serb. tsr- (öðòàœå) “draw, draw lines”, Rus. cher- (÷åðòèòü, ÷åðòåæ, ÷åðòà) “draw, drawing, line”, Slov. kre- (kresli); Hu. ir- “write” (s/r alternation). Practically no presence in the IE family, obviously of non-IE origin, any IE etymology is figmentary. The standing IE etymology stops at Lat., under a dubious assertion of origin fr. a “PIE proto-verb” *kae-id- “to strike”. The notion “carve” predates Lat. and Eng. by a long shot, the word came to Eng., and by the same token to Sl. (but not Balt.-Sl.) via independent paths. OFr. likely gained the word fr. the Türkic milieu, either Alanic or Burgund. As indicated by the consistent initial ch-, the word came as a legacy. Like other survivors in Eng., it was a trade term below a radar of the grammarians. The stem čiz- “draw” has European siblings ris-/riz- and Skt. rikh- “carve, scratch, draw” (derivative rizan, Turkish resim) that left a lasting trace in Gmc. and Sl. languages for the notions “draw” and “write”. The Skt. inlaut -kh- (-h-) is consistent with the s/h alternation typical for the Aral basin vernaculars, Cf. Lat. ignis vs. Skt. agnih “ignite”. The perfect phonetic and semantic match between the Türkic čiz- and Eng. chisel, an absence of European presence, and identical semantic developments to “draw (depict)” and then to “write” leave no room to doubt a common genetic origin from the Türkic phylum. See write.

English chitchat (aka chit-chat, chit chat) (v., n.) “gab, schmooze, buzz, hum” ~ Türkic čit čat (v.) “make thud (sound), noise”. The Türkic notion of “buzz, small talk” and the like can be expressed singly, reduplicated, or in combination of the elements čat, čet, čıt, čit, the form čit čat ~ chitchat is but one of them, synonymous with chat “talk idly, babble”, see chat, chatter. Another prominent expression of čat is contained in the term for “cat” četük that formed the European, Asian, and African term cat with its many allophones, see cat. The root čit/chit is likely unrelated to the word chit “improper girl”, or to the word chattel “private movable property”, a derivative fr. the root čat- “gather, assemble” (see chattel). It formed a notion cheat “practice trickery or fraud”, that is “talk somebody into doing something (for their own disadvantage)”, and a chatter “din, twitter, gossip”. No IE etymology, practically no presence in the IE family, thus obviously of non-IE origin. Cognates: A.-Sax. none recorded, Du. koeteren “jabber”, Dan. kvidre “twitter, chirp”. The č-/k- alternation parallels that of the form četük/cat. A primeval connection with the chirping of the birds and cat miaul have endured for the birds, attesting to an initial echoic origin. The words “chat” and “chitchat” entered Eng. as a verb and a noun, without any phonetic or semantic changes, further derivatives arose as innovations in the course of Eng. evolution. They are members of the paradigmatic transfer series along with “say”, “tell”, “tale”, “saga”, “message”, and other words related to verbal communications. The compound chitchat is also a paradigm in its own right. Each word separately, and the systemic nature of the paradigmatic transfer indisputably attest to the Türkic linguistic substrate. A calculation similar to that for “chew” q.v. for a chance coincidence for the whole paradigm would produce a number thousands times smaller than the probability value P assessed for the “chew” paradigm, a positive impossibility. See cat, chat, chatter, chattel.

English chop (v.) “cut into pieces”, (n.) “(lamb) chop” ~ Türkic čöp (chöp) (n.) “piece (of meat)”, čap- (v.) “chop, strike (behead, with sword, whip)”. The Türkic noun-verb primacy is transposed in Eng. For a verbal stem, the OTD and EDT list čop as čap-, but G. Clauson observes that a surviving causative denoun verb čobart- “strip” is a product of the verbal stem čöb-/čöp- (Clauson, p. 398). The same is applicable to the passive verbal form čobul- “split, part”. Whether the primary was a noun čöp “piece” or a verb čop- “chop, strip, part apart” is untenable. The surviving OTD and EDT attestations of the area and period sufficiently corroborate G. Clauson's presumption of the verb's usage outside of the purview of the sources: čö:b (čö:p) “residue”, čöbik (šöbik), čöpür “debris”, čopra: “rubbish”, , , čobulmak “apple piece”, čaput/čapğut “quilted”, čaptur- “strike, ruffle”, čapıl- “striken”, etc. In the G. Clauson's opinion, -b- was a primary articulation, and -p- was a secondary. That belief too is untenable; the -p- form may help with diagnostic evidence. Cognates: MDu. kappen, Dan. kappe, Sw. kappa “to chop, cut”, ONFr. choper, OFr. coper, Fr. couper “to cut, cut off”. No IE links whatsoever, the IE etymology does not reach even the Lat. The three somewhat different phonetic forms indicate three different paths to English, Germanic, and Romance groups respectively, and also may help with diagnostic evidence. In Europe, this word is associated with the area populated by R1b Y-DNA-marked haplogroup, in the Eurasia this word is associated with the Türkic people marked with varied mixtures of the R1b and R1b Y-DNA haplogroups. The English chap, chapp “crack, split, burst open” appears to be an allophone of the Türkic verbal stem čop-. The čöp as “leftover pieces, leftovers, rubbish, sediment” also produced the Eng. chaff, probably via a separate path of a dialectal form čö:b. The English slightly denigratory chap “boy or man” is a reflex of the Türkic idiom čöb (kiši:le:r) “(human) rubbish”. The English chop (“chopstick”, lamb chop, etc.) is a reflex of the Türkic form čobul- “split”, Cf. čobulmak “slice (of apple )”. See chaff.

English circle (v., n.) “ring, encircle” ~ Türkic sürkülä- (sürkïla-, sürgü:le:-) (v.) “pursuit” fr. sür- (v.) “lead, drive, pursuit”. Phonetic allophony is striking. The terms “drive, pursuit” are terms of traditional Türkic encircling hunting methods, they provide both phonetic and semantic unity. Türkic has numerous terms and derivatives related to encircling hunts, attesting to the ancient (hunter-gatherer society) origin and geographical spread of the terminology: abla-, avla-, er-, ir-, qačrus, qapsa, qov-, saɣïr, sür-, sürkïla-, sürkülä, sürus, and probably few more. Cognates: Lat. circulus “small ring”, Gk. kirkos “ring”. Distribution attests to a non-IE origin. No IE predecessors. IE etymology is non-existent. A speculated IE derivation of “circle” from “circus” is unattainable. Another hunting word qapsa “encircle, surround, surround on all sides” is etymologically connected with the English “capture”. See capture.

English clinch (v.) “clench” ~ Türkic qïlinč (v.) “tie, link, brace, girdle, clinch”, from a stem qïlïn- “to come about, arise”: be made, formed, appear, arise. Ultimately fr. qïl-“do, make, act, copulate”. The semantics “clinch” is synonymous with the metaphorical semantics of “copulate”: have connection, do it, have a go at it, get it on, and the like. The English word fell from a blue sky in 1560s, with no IE or any other origins. The intellectual cultural influence of the Ottomans or Mongol-Tatars can be sensibly excluded. The word most likely survived in its pristine form in the context of wrestling matches, an eternal Türkic tradition along with the game of polo.

English coach (v.) “drive, ride a coach”, coach (n.) “carriage” (Sw N/A, F1934, 0.00%) ~ Türkic köč- (köch-) (v.) “ride a coach”, coach (n.) “carriage, wagon”. Ultimately “coach, migrate” (v.) fr. a base Türkic root qo-/qon- with notion “descend, stop, settle”, see come, go. The word is neutral to transitivity and noun/verb function, attesting to its utter archaism. It is a member of a cluster gi:r-/ki:r- “enter”, git-/kit- “leave”, göč-/köč- “move”, qo:n-/kön- “descend, stop, settle”, gö:q-/qoq- “go down, decrease, subside, fall off ” conveying different motion aspects relayed with means of phonetic synharmonism. The cluster points to a single consonantal phoneme root g-/ɣ-/k-/q- “go”, suffixed with a single consonantal phoneme č-/n-/q-/r-/t- to form grammatical aspects of the notion “move”. The single consonant word preceded by far an appearance of a productive pastoralism. Initially it had to be a word of a time span from a roaming to a hunter-gathering economy. It is an ancestor of the Eng. come, go, etc. Such evolutionary scenario calls for batches of cognate lexemes diffused across Eurasian linguistic families. In the nomadic society, a derivative coach must have been a most popular word that defined a daily life, with a wide spectrum of semantic forms and extensions. Its traces are found in most languages with a Türkic nomadic component or directly affected by their steppe neighbors. For millennia, coach was a pinnacle of progress, a transportation, a home, a homely hearth, a focus of life, and a way of life. Sedentary people do not have nor need a trade lexicon related to professional nomadism. They have to borrow it from the nomads, thus the abundance of the Türkic terminology across Eurasia. Cognates: Gmc.: Eng. coachman, Dan. kusk, Du. koetsier, Gmn. Kutscher, Norse, Sw. kusk koucher; MFr. (16th c.) coche, Gmn. kotsche; Balt. group: Latv., Lith. kučieris; Sl. group: Serb. koch (êî÷) “home”, Bosn., Croat kočijaš, Pol. kîñz “cart”, kîñzó “wagon”, Sl. Bulg. gjoč “family”, kochiyash (êî÷èÿø), Czech kočí, Ru., Serb., Slov., Slovt., Ukr. kucher (êó÷åð); Romance group: Cat. cotxer, Fr. cocher, Galician, Port. cocheiro, It. cocchiere, Sp. cochero; Hindi kochavaan; Pers. kuč “migration, striking camp, marching”; Fennic group: Est. kutsar, Fin. kuski, Hu. kocsi (~ Türkic köčür “transfer, move”); Basque kotxezainak; Brahmi köjür “to coach, move”; Mong. köčke “caravan”, köške “travel luggage”; Ch. ganche 赶车; except as noted all “driver, coachman”. Allophones of the form köch are known from Mesopotamia to the lake Balkhash and to the Far East. The depth of the attestations related to the carriage industry in the Eurasia, the distribution of cognates and a variety of phonetic forms attest to the antiquity and diversity of internalizing paths. A calque from the Türkic nomads with their mobile homes, Serb. has koch (êî÷) for their stationary homes. The Ch. form ganche ascends to the earliest known form kang “cart, wagon” recorded in Sum. The Ch. name Gaoche 高車 for the northern nomads is a compound köch + -chi meaning “coachmen”. The Ch. kangchi “coachman” was internalized in Ch. complete with the Türkic instrumental suffix -chi. The abundance of unacknowledged cognates in all diverse Eurasian linguistic families attests to grubbiness and manipulations within the IE linguistic cohort. The IE etymology “from a name of a Hu. village” is beyond laughable, it is a disgrace for the entire IE etymological industry. For a word that still occupies a major place in the European and Eurasian vocabularies, from transportation to home furnishings to tending to home itself, that IE-centered “etymology” is most uncouth, brazen, and dishonest. See Celt, come, gamut, go, hide, home, house.

English coagulate (v.) “turn from liquid to thickened or solid state” ~ Türkic qoyul- (v.) “thicken, inspissate (liquid), curdle, coagulate (of milk)”, a passive form of koy- “put, put down, abandon, give up”. The ultimate origin comes from the notion “squeeze, press, dry”, and is associated with preparation of curded milk products, from yogurt (curdled milk) to qurut (dried curd). The form cogul is an Ogur version of the Oguz qoyul. The English curd/curdle is a relict. Cognates: Gmn. gerinnen; Icl. hlaupi; Gael. gruth; MFr. coaguler, Lat. cogere “curdle, collect”, coagule, coagulare “cause to curdle”; Azeri čürü- (chürü-). No IE or any other etymology whatsoever. “Coagulate” became an international word in every European language, but many languages of different linguistic groups retained their own terms for “thicken, curdle”, pointing to the pre-historical usage of milk (Celtic, Fennic, Gk., Alb., etc). Türkic has allophonic forms that correspond to Lat. and Gmc. versions, qoyul- and čürü-. The semantic coincidence is perfect, the phonetic correlation is expected with liquid -r-/-l- alternation. The paths from qoyul-/qogul- to Lat. cogere, and the Eng./Gael./Gmn. from čürü-, to curd/curdle/gruth/gerinnen are separate in space and time. Like the Lat. form, the Icl. form hlaupi is connected with the Türkic qoyul-, but points to a separate path for historical reasons: sheep herding was a main Norman means of subsistence, they did not need a Lat. loanword to be borrowed, and the opportunities for such borrowing did not exist. See curd, cuddle.

English collect (v.) “gather together” ~ Türkic kölar (v.) “collect, accumulate”. The Türkic verb is associated with water accumulating in lakes and depressions. Its semantic extensions related to payment collection and collective enterprises like collector, collective, and collection are much later developments, and not on the Türkic soil. The Lat. collectus, colligere “gather together”, albeit quite old, already constituted innovations spread around with the Roman culture. No IE cognates. Excepting Lat. borrowing from the Ottoman Turks, this is an another Türkic word the Latins were endowed with during the Scythian times. The Lat. collatus “gather together”, that in Eng. developed into the verb collate “sort out, order”, is also a semantically conflated derivative of the Türkic kölar, and not as suggested an irregular form of the verb conferre “gather”, see confer. In English, collect may be just another Türkic word lurking in the local vernaculars that gained visibility under Roman influence. See confer.

English comb (v.) “groom (hair), ransack (area)”, (n.) “toothed tool” ~ Türkic kem-, kemdï-, kemür- (v.) “gnaw, bite”. Ultimate semantics is “to bite something, ruminate”. Within the Türkic milieu formed allophonic forms gev-, kev-, kep-, kip-, keg-, and a good half a dozen more. The -m- form must be a reflex of the -v-/-p- forms with m/b alternation. Cognates: A.-Sax. cemban, comb, cambo, OSax. camb, ONorse kambr, MDu. cam, Du. kam, OHG camb, Gmn. Kamm “comb”; Sl. kusat (êóñàòü) “to bite” fr. a Türkic allophone kuyshe- “to bite”, kushat (êóøàòü) “to eat” fr. a Türkic allophone kushe-; Agnean kam “tooth”, Kuchaean keme- “tooth”; Dagur kem- “chew”, Baoan kamel- “to bite”. The Celtic forms Ir. cior, Gael. cìr, and Welsh crib/grib “comb” echo the Sl. forms, with r/s alternation typical for the Ogur/Oguz distinction. The uniformity of the Celtic forms bespeaks of the common origin from a point in the 6th-5th mill. BC E. Europe prior to the Celtic anabasis. The relicts kempt, cam, and the modern camshaft and their siblings are the modern versions of the cognates. The silent auslaut -b apparently is a handy discriminator from an acoustically different version of the final -m-. The peculiarity of the -m- form vs. a slew of other versions bears a certain diagnostic value. The vowel is notoriously fluid. Distribution across linguistic families, across the entire width of the Eurasia, and exclusively within the Gmc. and Sl. groups in Europe excludes any IE uncouth speculations of the type like the “PIE” *gombhos with a “proto-word” *gembh- “tooth, nail”. Instead of unwittedly ornamented model, the IE etymology should have cited the direct, attested, and documented Türkic form gem- (v.) “gnaw, bite”.

English come (v.) “arrive, reach destination, happen” (Sw N/A, F67, 0.38%) ~ Türkic qon-, ko:n- (v.) “reach destination, stop”, Ultimately fr. a polysemantic verbal root qo- “come to a destination”, “leave (something), put, put down, abandon, give up”, “all, together”, expressed in a wide range of concrete and metaphoric applications with a rich trail in the European languages. The basic notions refer to the separate aspects of reaching a destination: coming to a stop, staging a stop, reuniting at a stop. The basic form qon- has an allophone qam- “put down, knock down, lower” with the meanings of the sister form qod- “put down”, i.e. “settle for now”. It has survived and blossomed into numerous derivatives, ending with the Eng. gamut, gamma and come, cum, common, commons, commodity etc., see gamut. The split points to a relatively late differentiation of the form qam- into the specialized areal “come” and “put down”. The semantic raster extends further out, Cf. Kelt/Celt “newcomer”, camp “stopover” company, campaign “gathering”. The stem qon-/qam- is a deverbal verbal derivative, it carries a long trail of verbal and nominal derivatives rooted in the specific notion of reaching a stopping point destination. The word and its first derivatives must have arisen during an agelong hunter-gatherer period, long before the coming of a pastoral period. The form qon- must have formed before the other derivative verbs, qop- “all, together” and qod-/qam- “put down”, which are the attributes of the base notion “come to a stopover” and its deverbal noun derivative “camp, stopover”. Cognates: A.-Sax. coman, cuman “come, approach, land”, Eng. cum “experience orgasm”, and the above gamut, common, commons, commodity, gamma, etc., OSax. cuman, OFris. kuma, ONorse koma, Goth. qiman, MDu. comen, Du. komen, OHG queman, Gmn. kommen, Icl. kom; Lith gemu “be born”; Sl. konets, konec (êîíåö) “end point” in a number of Sl. languages and allophones across Central and Eastern Europe. No credible IE common word, no European common word, the Lat. veni is but one of the many etymologically unconnected words in the European lexical spectrum. An alternate model, a hypothetical PIE fantasy *gwa-/*gwem- in lieu of the attested qon-/qam- is not needed. It erroneously reverts to the attested unrelated Türkic verb git-/kit-/ket- (ked, keδ-, kej-) “go”, with extensions “leave, depart, pass, disappear, die, refrain”. The two Gmc. forms attest to amalgamation of -o- and -u- dialects, still present among Türkic languages. The Goth. qiman and Lith gemu illustrate a general fluidity of the vowels. The -m-/-n- alternation is a regular effect of amalgamation with local vernaculars, whether of Türkic or Gmc. variety. The persistence of the specific -m- form among the Gmc. languages points to Gmc. initial linguistic uniformity and thus a compact spread. Aside from the come, European languages retained the form qod- “put down” that developed into the now international words code, codex, and their -i- versions codify, codicil, coding, codicology, etc. The massive presence of the of the Türkic derivatives in Eng. and other European languages attests to numerous parallel cases of paradigmatic transfers of the semantic allotropes ascending to the verb qo-. Of those, a paradigmatic transfer of the derivative qon-/qam- is a particular case. They incontestably demonstrate a genetic origin from the Türkic milieu. See Celt, gamut, go.

English con (v.) “steal by deceit, swindle”, conman (n.) “swindler” ~ Türkic kun-, qun-, xun- (v.) “snatch, steal, rob, carry off”, with literal and metaphoric applications. Semantically and phonetically it is close to Eng. cunning “shrewd deceptiveness”, they did not leave an A.-Sax. or IE trail. Cognates: A.-Sax. none, Gmc. none, IE none. On purely phonological differences, with quite ingenuous artistry, IE etymology ascends the siblings con and cunning “skillful in deception” to different roots. Those roots carry incompatible semantics and a temporal distance (“lateral time”) in millennia: “confidence” vs “to know”. In the first case it is exploiting a stripped prefix con- “with” (intensifier), i.e. “with (fidelity)”, OFr. conceveir), Lat. con(cipere), con(ceptus) “take, hold”. The conjecture is laughable, it portrays Türkic qon-“with” as an IE com-/con-/co-“with”, and Türkic kap-/qap- “hold, capture” as an IE faux *kap- “grasp”, see com-, keep. In the second case it portrays Türkic  köni- “verity, truth” as an IE fictitious *gno- “know”, see know. Every suggestion is an unwitting flop. The phonological difference between -u- and -o- should not be overrated, in Türkic -u-/-o- interchanges are routine, albeit they could signify two different paths. Phonetic and semantic match is as close as they come. The accent shift from generic to a stealth mode is a minor modification since the semantics of “snatch” carries an implied shade of “deceit”. A derivative konurčuk for a “doll”, lit. “pretender (toy)”, illustrates an application bridging the notions of “deceive” and “purloin (by deceit)”. The absence of recorded attestations for con and cunning in the Gmc. languages is consistent with numerous other lexemes that came to the linguistic attention on another continent, no more tamed by the “correct” English, Cf. boss, OK. A possible storage closet for these words was cockney, the Türkic köken “motherland, native place, ancestral land” that referred to the motherland's language, or what was left of it. A.-Sax. had 9 words expressing a notion “deceit” (bepaecung, beswic, dwild, fleard, folcwoh, forleornung, leasbrednes, listwrenc, lotwrenc), of which just one had survived to English (swindle, A.-Sax. swice). The word con is an another survival, possibly, but not necessarily, via A.-Sax. path. See boss, cockney, com-, cunning, keep, OK.

English confer (v.) “bestow” ~ Türkic ber- (v.) “carry”. See bear (v.) for polysemantic meanings and cognates of ber- . Cognates: MFr. (con)ferer “give”, with extensions “converse, compare”, Lat. (con)ferre “bring together”, with extensions “compare; consult, deliberate, talk over”, fr. Lat. prefix com- “together” + ferre, an allophone of Türkic ber- “bear, carry, give”. The Lat. prefix com- of Türkic origin (see com-) came to Eng. with Lat. loanwords (confer, conference, conferee, etc.) unrelated to English substrate, while indelibly illustrating the Turkisms' Lat. path to English. Related conference. See bear (v.), bestow, com-, give.

English crunch (v., n.) “crackle” ~ Türkic qurt-, kürt- (v.) “produce crunching sound, crunch”, karč, qarč, qurč, qars, qars (n., v.) “crunch, crackle”. Etymology: no original source claimed, “probably of imitative origin”. Supposedly no cognates, but... Balto-Sl.: Balt. (Latv.) skraustet “squeak, rattle”, Balt. (Lith.) skrudeti “crack, flake”, (Sl., OCS) hrst, hrust (õðúñò, õðóñò) “crunch”, “scrunch”, “crackling bug”, (Pol.) chrustač́ “crunch”; Gr. vrouchos (βροῦχος) “crackling bug”. Probably it is truly of imitative origin, and the Türkic qurt (ɣurt with glottal ɣ-) and similar forms suggests the source of the imitation, further supported by numerous Balto-Sl. reflexes. The claim “of imitative origin” is quite peculiar in that this echoic crunch is contained only to NW Europe and the Eurasian steppe belt. The -č/-nč/-ch/-nch is a Türkic denoun suffix forming synonymous nouns, it has survived intact in English crunch turning into an integral part of the root, and under various allophonic morphemes is visible in Balto-Sl. and Gk. forms. The appearance of prosthetic anlaut s- in some forms is consistent with process of adaptation of other loanwords into Baltic family. Of the European forms, the Pol. form chrasc (xrasch) is closest to the English crunch, probably it was an Anglo-Saxon form before it became a Pol. form. Semantic extension to anything that is conspicuous by a scrunching sound is an ordinary process congenital to all languages, in Sl. such extension formed a word for cartilage, the Eng. came up with crush, etc. The late record (17th c.) attests that like many other Turkisms, it was a “folk speech” lurking beneath the literary language.

English cry (v., n.) “shout, sudden loud utterance” (Sw N/A, F1438 0.01%) ~ Türkic qarqïla (v.) “shout, cry (utterance)”. Ultimately fr. echoic raucous call “qarr” of crow, -ï- prosthetic junction, -la verbal denoun suffix. Alternatively čarla- “shout, cry”. Alternatively orla-, orlaš-, urla-“cry”. Alternatively qïqïr- fr. qaqïla- with a silent second consonant, ultimately fr. kö:k related to sky, a basic notion of “thunder” (G. Clauson's suggestion). Etymologically attributed to different origins, all these forms are uniquely consistent, suggesting a long prehistory and plenty of mutual influence that ruffles any notion of a single etymological source for each one. Cognates: A.-Sax. ceir (with k-), OSax. hragrà, Goth. hrukjan “to crow”; OIcl. skrikià, Gael. gairm “a cry”, theoretically one of candidates for Eng. “cry”, Cimr. ñróñh; Romance group: OFr. crier, VLat. quiritare “wail, shriek”, It. gridare, OSp. cridar, Sp., Port. gritar; Balt. (Lith.) kryksti, Balt. (Latv.) krikà, Sl. krik (êðèê), and kukariku (êóêàðåêó) “cock-a-doodle-doo, rooster's cry”; Gr. krike (κρικε); Ar. qara’a “aloud” (Cf. Quran), Heb. gaar “shout”, kara “cry”, Ugar. qr' “call, invoke”; Akkad. karu “call, invite”; Mong. khertsgii “to crow”. The OSax., Goth., OIcl., Cimr., Lith., Latv., Sl., Gr., Mong. forms ≡ qarqï. The Eng. battle cry in A.-Sax. sounds beadu ceir and in Tr. something like batten qarqï, with second -q-, if any, assimilated to palatal or elided. Distribution covers Eurasia from Atlantic to Pacific. It traces the spread of the Celtic Kurgan people from Iberia, their incursion to the Apennines, and an independent overland path from the steppe belt to the Baltic and on toward Albion, a distinct path of the Kurganians. It excludes the “PIE” faux *qer-/*qor-/*qr- as a viable vehicle for Eurasian distribution. In spite of abundance of cognates pointing in one direction, the IE etymology holds it “of uncertain origin”. The late Romance acquisition probably came with the Burgunds of Provence. The sense of weeping is a much later innovation, probably on religious grounds. The root qar- is specific to a certain group of Türkic languages - specifically mentioned are “Tatar”, Chuv., Tuvan. The term “Tatar” is a Russian moniker for a multi-lingual political entity, like “American” or “Canadian” “languages”, it is largely Kipchak and Bulgar. The OSax. and similar forms are distinct reflexes of the Türkic qïqïr- with its dialectal allophones. See call, gabble, say, tale, tell.

English cuddle (v.) “nuzzle, embrace for comfort, hug” ~ Türkic kod-, koy- (qod-, qoy-) (v.) “lay in embrace”, from kod/koy (qod/qoy) (n.) “breast, bosom, embrace, hug”. Türkic has dialectal allophones of kod (n.) in the forms qon, qoyïn, qoyun. The form with passive suffix -l that matches the Eng. form is kodul-/koyul- “placed, nuzzled”, with quite a few more meanings, one of which is a kind of “pacify” leading to “thicken”, coagulate, and curd. Turkish has two forms, apparently coming from two dialectal groups, koynuna and kucak (kujak), originating from the same root, all with a meaning “breast, bosom, embrace, hug”. Cognates: A.-Sax. cull, coll (v.) “to embrace” (recorded with elided -d- or -y-). No IE etymology, the best not too enlightening and not too insightful attempt suggests “at first a nursery word” without a clue on where and when was the nursery. On closer inspection, the English pra-mothers had Türkic bosoms and gave Türkic hugs to their nestlings. Cuddle is intimately connected with coagulate. See coagulate, curd.

English cull (v.) “pluck” ~ Türkic čul- (chul) (v.) “pull out, pluck”. Among allophonic forms yul-, yol-, etc., čul- is alone from Khakass (aka Enisei Kirgiz), paralleling other English Turkisms q. v. peculiar to Khakass. Semantically, it is overlapping with Eng “pull”. Cognates: OFr. cuiler “pluck, select, collect”, Lat. colligere “select, choose”, probably also related to Lat. coleus “strainer bag”. No inkling to IE origin, no hint on IE connection, instead go wild speculations based on phonetic consonance, “testicle”, “man”, “dupe”, etc. The IE speculations exploit various late derivatives with vague semantics. Notably, A.-Sax. has an overabundance of 12 stems to express “pluck (v.)” (alucan-, aplue-, gad-, luc-, nim-, pluc-, plyc-, pull-, traeg-, twic-, upateon-, utabre-), formed with words from numerous dissimilar languages, but it does not have the Eng. stem for cull-, attesting that the cull- lived beneath and independent form the strata of written records. In that, the cull-, like many other Eng. Turkisms, belongs to the lexicon that largely escaped literary language and suddenly appeared fairly late in the body of the Eng. vocabulary. The perfect phonetics and semantics attest to the allophonic nature of the Türkic and Eng. forms. The Eng. noun “reject, discard, cast away” is a colloquial extension used in some trade lingos.

English curdle (v.), curd (n.) “coagulate, coagulated liquid, milk” ~ Türkic kurt, qurt (n.) “curd”, “in some north-central, north-western languages” (EDT, p. 648). In other Türkic languages prevails an allophonic form kurııt. Both forms are derivatives of the verb kuri:- “to dry”, and ultimately fr. the verb kur- “fix, set in order”, widely used in physical and metaphorical contexts. A cured NY steak is dried, not healed. Alternatively, curd may be a verbal form of a noun ko:r “curd” with verbal suffix -t/-d that reverted to the noun use. The -l- in curdle is a verbal suffix of passive voice, “curdled”. Cognates: A.-Sax. crud, Gael. gruth (both closer to the form kurııt); no cognates in other Gmc. or Romance languages. The absence across IE family attests that the word is non-IE, and any assertions of a “reconstructed PIE” word are an overt sham. The IE etymology suggests an origin fr. “urge, coerce”, which is an unsuitable metaphoric and not physical expression, and uncouthly suggests a connection of the milky crud with “crowd”. An imaginative trip to a fantasyland substitutes for a sober analysis. A metathesis of A.-Sax. crud “press, drive” ascends to the verb kuri:-, q.v. The presence of the Celtic form attests to the existence of the word in the 6th-5th mill. BC N. Pontic area. The Türkic-Eng. precise semantics and perfect phonetics attest to a common origin, and the Gaelic form attests to the origin in Neolithic times, prior to the Celtic migration ca 5th mill. BC.

English cure (v., n.) “fix, mend, heal” ~ Türkic kur-, qur- (v.) “fix, set in order”. Both Eng. and Türkic verbs have a wide range of meanings, the exact meaning cannot be determined without knowing a subject. The term “cure, curing (food processing)” ultimately also ascends to the Türkic verb kur-, but originates from its derivative kuri:- (v.) “to dry (i.e. fix by drying)”. It refers to a process of hardening or solidification by cooling or drying, which covers not only the traditional conservation of food by drying, salting, smoking, or heating, but also cures concrete and plastics. A medicinal usage is late, an extension of “take care”. Cognates: OFr. curer, OSp. guarir, Fr. guérir, It. guarir, Lat. curare, cura “care, concern, trouble”. Some semantic extensions are astounding: secure (Lat. se + cure), curable/incurable (Lat. curabilis), sinecure (Lat. sine + cure), procure (pro + cure), scour (Lat. ex + cure), manicure (mani + cure), curator, curate (cleric), cure (cleric), curia, and more. The severely confined distribution attests to a non-IE origin, a status of a loanword within the IE family. A spectrum of semantic extensions in Eng. attests to a path separate from that of the Romance languages, and separate from the A.-Sax. lexicon, where “cure” supplanted a native “heal” and Old Europe lacnian. The Romance path may be via Alan/Gothic (Visigoths) and Burgund, 1st mill AD. A late Scythian/Cimmerian path via Jute and Frisian could be a possibility. The IE etymology is erroneous in confusing the two independent homophones. While the first has numerous noun derivatives, the second has only one English noun, curing. A phonetic and semantic match leaves no room to doubt a Türkic origin. See cure (food).

English cure (food processing) (v.) “harden or solidify” ~ Türkic kuri:- (v.) “to dry (i.e. fix by drying, usually by smoking or heating)”. The Türkic word originated in food processing, ultimately ascending to the Türkic verb kur- (v.) “fix, set in order”, widely used in physical and metaphorical senses. Cognates: Balto-Sl. (Lith.) kuriu “to heat”, karštas “hot”, (OCS) kurjo “to smoke, cure”. Both Balto-Sl. forms practically exactly repeat the Türkic word and are identical semantically. The English term refers to a process of hardening or solidification by cooling or drying, which covers not only the traditional conservation of food by drying, salting, smoking, or heating, but also extends the meaning to cure concrete and plastics. The IE etymology is erroneous in confusing two independent homophones, cure (food processing) and cure “fix, mend, heal”. For example, kurgan is a derivative of kur- (same verb base, here meaning “organize, arrange”), the same with kurultai “arrange (family) ties”, they are unrelated to kuri:- “dried, hardened”. While the first has numerous noun derivatives, the second has only one exclusively English noun, curing, and a handful of peculiarly distributed cognates in Tr. and Balto-Sl. languages. Inability to understand homophonic nature of the words is a result of circular logic unable to recognize reality in conflict with dogmatic scenarios. See cure (fix).

English count (v., n.) (Sw N/A, F893, 0.01%) ~ Türkic köni (v., n.) “measure”. Cognates: IE cognates practically absent, save for questionable Lat. computare “to count, sum up, reckon together”, and its Romance cognates. It allegedly comes from com- “with” + putare “prune” > “to reckon”, a trick repeated over again where a prefix produces a substantive, semantically bordering on impossibility. Phonetic transition from computare to count is inconceivable given 13 other very normal ways Romans could say “count”, and the “count” was not one of them. The term evolved independently from a different root (vs. Lat. numer-, duc-, calc-, etc.). The other Eng. word tally for “count” ascends to the primordial Türkic stem tili/tele/dili meaning “tell”. The trio count, tally and quantity constitute an indelible case of paradigmatic transfer testifying to a common origin from the Türkic phylum. See quantity, tally, tell.

English cut (v.) “separate” (Sw N/A, F537, 0.02%) ~ Türkic kıd-/kı:d-, kıy-/qïj-, kes-/keš- (v.) “cut”. The form probably evolved with the notion “bite”, in different Sprachbunds expressed with verbs kus- and bit-, Cf. Sl. kus(at) (êóñàòü) “bite”, A.-Sax. bit(an) “bite”. Both forms are innate to Türkic, with related semantics. As a rule, the verb cut has complimentary deverbal nouns “cutting, piece, part, share, separated” phonetically identical with the verb. The allophonic forms were moving around, diverging and rejoining, taking various semantic shades, with their own handy areal means forming deverbal noun derivative notions “cutter”, “chunk”, etc., and their further progeny. Cognates: A.-Sax. ceor- “cut, separate”, ceorfaex “axe”, ceorfsaex “scalpel”, Sw. kuta “cut”; Akkadian kes-, kas- “cut, chop, break, shorten, abbreviate”, käs- “separate into small pieces”, the earliest record of 28-24 cc. BC; Akkadian borrowings into Arabic, all related to “cut”: kasap, kısım, kasım, kısmet, etc.; Heb. gedud “cut”, geduda “incision”; WMong. kidu- “cut”, Mong. khutga “cutter, knife”. Cognates also include innumerable derivatives of “cut”: Eng. cutter, cutlery, A.-Sax. ceorfaex “axe”, ceorfsaex “scalpel”, ONorse kuti “knife”, OFr. couteau “knife”, Rum. kutsit “knife”; Türkic kert (v.), Eng. carve (v.) “cut a mark”, OT kingirak (Gk. form akinak) “knife”, kezlik (dimin.) “small knife worn by women”; WMong. kituga “knife”, Dongxiang quduyo “knife”. Cognates cut across linguistic families to a degree that the word was held as a Nostratic ancestor. Distribution extends from the Atlantic to the Far East, the IE portion is but a small isle on the map, a PIE origin is delusory. The IE etymology of the verb “cut” deriving from the Fr. couteau “knife” is as absurd as to derive the A.-Sax. ceor- “cut” from the ceorfaex “axe”. Such absurdities are waiting to be rescinded. The notion of “cut” probably ascends to Neolithic economy, associated with first cutting tools, and had plenty of time to disperse its phonetics. In all languages the verb “cut” is a prime, all other grammatical forms are derivatives. From the western Türkic milieu, the form kut- “cut” is practically the only form that left a trace in Europe. The spectrum of vowels in various languages suggests that the original vowel was of front quality: ka-/ke-/kı- (kï-)/ku-, probably weakly or vaguely articulated. Hence the raster of forms found in base forms and derivatives: kad-, kaf-, ked-, ker-, kes-, kıd-, kıf-, kıy-, kut-, with some other viable variations that escaped records: we do not know the quality of the vowel that stands behind keor- (A.-Sax. spelled ceor- “cut, separate”). Variation in the second consonant points to dialectal alternation -s/-r typical for Oguz/Ogur vernaculars, Cf. Heb. chereb (xereb) “knife”. Sound change d <=> z is typical for numerous Türkic languages (Gmn. in Europe), -f may be a rendering of -ð/-þ, and -d is regularly changed to -y (Cf. badram/bayram/mayram “feast”). All these forms are acoustically allophonic. Phonetic variation is well pronounced in the derivatives: kadıš “strap” (lit. “a cut”), kıftu: “scissors” (lit. “cutters”), kaftan “coat, robe”, (lit. “scissored”), kedir-, kedriš- (“cut a strip”), kert- “cut a notch”, kıd-/ kıy- “cut, separate”. The form kut- is an allophone of the kad-/ked-/kıd-. The Eng. semantic raster mirrors that of the Türkic (e.g. cut me a deal, cut them off, cut corners, cut it out, etc., for withdraw, change, abandon, etc.). The absence of cognates in Celtic languages attests to the origin of the term independent of the post-4800 ybp (2800 BC) arrival of the Neolithic Celtic carriers of the R1b Hg to Iberia via Near East and N. Africa. European spread is probably connected with the overland waves to the Central Europe of the Kurgan nomads starting at 6300 ybp (4300 BC) with later near-total population replacement, and then continued accretion of the Kurgan nomads, the Cimmerians, Scythians, Sarmatians, and the Huns into the Central Europe from 2800 ybp (800 BC) to 1500 ybp (500 AD). The spread of the sources in time and to different ethnic groups predicates a wide variety of allophones and cognates, some with lost initial stem. See carve, curt, short.

English dash (v., n.) “move quickly” ~ Türkic daš- (dash-), (v.) taš- (tash-) “erupt, burst over the edge (like boiling milk)”, with connotation “sudden spill, burst spill”. The Az., Tkm., Tuv. form is daš- (dash-). A word dashboard “pane protecting from splashing (rain, etc.)” is a neologistic relict of a long-faded base meaning. Cognates: none outside Türkic languages. Homonymous with Eng. dash (v.) “break into pieces” of the same Türkic verbal stem daš-, see dash (v.) “break into pieces”. Various meanings with semantics of “sudden, eruptive move” are later innovations: race run “dash”, brilliant act “dash”, etc. The dash “showy appearance” is another meaning of the same stem in a noun form: daš (dash) (n.), see dash “appearance, look”; and accordingly the adjective: daš (dash) (adj.), see dash “outer, external; outer (of clothing)”. Perpetuation of particular polysemantic word with its particular semantic meanings is a positive attestation of and genetic heritage. The chances for borrowing or chance coincidence in case of paradigmatic transfer are nil. See dash (n.) “appearance, look”, dash (v.) “break into pieces, broken”.

English dash (v.) “break into pieces, broken” ~ Türkic taš-/daš- (tash-/dash-) (v.) “erupt (in flood), burst (through the (river banks)”. Cognates: Sw. daska, Da. daske “beat, strike”. Homonymous with Eng. dash (v.) “move quickly” and dash (n.) “appearance, look”, a case of paradigmatic transfer. See dash (move quickly), dash (n.) “appearance, look”.

Supplementary Note. Two identical words.

The chances that two words in one language would accidentally coincide in semantics and phonetics with two words in another completely independent language are slim to none. These chances can be assessed statistically. Allowing for simplicity 20 consonants and 5 vowels in each language (for Türkic, EDT lists 27 consonants, in reality about 10 more, plus 8 vowels), and some overlap like -t- vs. -d- that somewhat reduces that number, 10 consonants and 5 vowels in each language would be a conservative phonetic approximation. A 10,000-word vocabulary broken down into 1000 semantic fields (10 synonyms per field on the average) would be a quite conservative semantic approximation, since within a basic dictionary words blessed with synonyms are a small minority.

A completely independent language must belong to a separate linguistic family, and the subject word must be innate to that linguistic family.

A 3-phoneme word of 2 consonants and 1 vowel, like cat, would have 1/10 X 1/10 X 1/5 = 1/500 = 0.002 phonetic probability times 10,000 tries. About 20 words would statistically meet the phonetic criteria. Chances of these 20 words to meet the semantic coincidence is determined by the semantic probability.

The semantic probability would be 1/1000 = 0.001. For a word cat, that semantics would cover 10 statistical small animals: cats, squirrels, rats, and the like.

The probability of both phonetic and semantic match would be 20 X 0.001 = 0.02. It would take 50 of 10,000-word vocabularies from different languages to bring that low probability to a level of certainty, to find a single chance match in unrelated language for a single 3-phoneme word, say cat, also meaning cat or a similar small creature. All of the 6,000 world languages would have 120 statistical words meeting the desired criteria. In reality, a much smaller number of independent languages would produce a much smaller number of the random coincidences.

The probability of encountering the same word with certain different semantics, say cat meaning “cat” and “gossip”, would be, 0.02 X 0.02 = 0.0004. In a case of such duplet, it would take 2500 independent lexicons of 10,000-word dictionaries, way more of what our small world can supply, for that to have a meaningful chance to happen. Such a coincidence would be a sound proof of a paradigmatic transfer case between languages.

Thus, excessively conservative estimate for the word dash with only two meanings, “move quickly” and “break into pieces”, to have a simultaneous chance match in some other unrelated language, would statistically take an order of magnitude of some 2500 unrelated languages. Ditto with the two meanings “move quickly” and “outer, external”. For three concurrent meanings, “move quickly”, “break into pieces”, and “outer, external”, would be needed an astronomical 6,250,000 languages.

Then, the chances that the English and Türkic words dash are genetically connected are much-much better than 6,250,000 to 1. But in the 800-word study list, dash is not the only such case. A second case brings the chance probability to 6,250,000 squared, or 1 in 39,062,500,000,000. That is languages, not the words. An only objection to the magnitude of this fact may be that a doubter is not keen on statistics. That, no doubt, would be very true.

English deem (v.) “think, recon” ~ Türkic demek (v.) “think, mean, believe, suppose”. Ultimately a semantic derivative of the base te:-/de:/-diy-/tejän “tell, say”, the same base that developed Eng. tell “tell”, relaying a kernel notion of “speech” and “thought”. The deverbal verbal form shows in the verbal suffix -mek “make” for the root te:-/de:, lit. “tell-make/act”. Unlike the tell “tell”, demek “deem” is not used in direct speech, but in conjunction with an impersonal form of the verb, e.g. “I deem that ~ in my opinion... (John came, it will be raining, etc.)”. For transposition of Eng. say and tell see say. Semantic spread extends to two dozen (!) immediate derivatives and numerous grammatical functions. The verb is held as one of the oldest Türkic words. Apparently, the form similar to demek (inf.) was internalized on the Gmc. and Sl. scene as a root dem- with morphological adoption for local use. The Gmc. extended usage focused on the notion “mean”, as in “I mean that”, the Sl. usage centered on the notion “think, thought”. Cognates: A.-Sax. deman “tell, deem, think, consider, judge”, dem, demend, demere “judge, ruler”, OFris. dema “to judge”, OSax. domian, Icl. telja “deem”, MDu. doemen, ONorse dma, OHG tuomen, Goth. domjan, doms “to deem, opinion”; Ir. deir, Gael. duil “deem”; Balto-Sl. (Latv.) duoma, duomat “thought, think, believe”, Sl. duma, dumat (äóìà, äóìàòü) “thought, think”, Rus. de-, deskat (äå-, äåñêàòü) “as though, as deem(ed)”, Bulg./Bolg. demek “think”; Gk. θῡμός “thought, spirit”, Skt. dhumas “thought”; Gujarati dhāravuṁ (ધારવું) “suppose”; Mong. duu “think”. Distribution extends across a width of the Eurasia, it covers only a smattering of the IE languages, and reliably excludes both “PIE” and “PG” origin. No suitable IE parallels, the IE etymological divination on PIE “set”, “put”, “doom” are totally inapplicable, albeit the mock “PIE root *dhe-” unwittingly mimics the attested Türkic te:-/de:. Geographic distribution and phonetic dispersion attest to a long history ascending to the 2nd mill. BC (Aryan migration), Corded Ware period (3rd mill. BC), and way beyond. The entire trio of the English “say/tell/deem” speech-related lexicon constitutes an indelible case of paradigmatic transfer inextricably attesting to the common origin from the Y-DNA R1a/b Türkic Kurganians' demographic milieu. See tell, say.

English deliver (v.) “give birth”, delivery (n.) “parturition, giving birth” ~ Türkic döle-/töle- (v.) “bore, deliver”, döl/töl (n.) “newborn”, Turk. “foetus, unborn child”, Tkm. “semen, lambing”, (v.) “give birth, lambing”. Near universally, along with more meanings, Türkic semantics is bifurcated into notions “parturition” and “bring, deliver”. The first addresses a baby, the second - a mother. A like bifurcation is carried over by the LLat. and OFr. versions. The precise semantics and phonetics, and enduring trait of bifurcation exclude any chance of a random coincidence between the LLat./OFr. and Türkic Kurgan words. Eng. has only 3 synonymous verbs in the “birthing” field, bear, birth, and deliver, 3/10,000 in a basic 10,000 dictionary. Türkic has 12 native words just for “birth”, some probably contributed millenniums ago by other languages. In pastoral economy, mating, insemination, and birth of a kid are routine business, hence the rich reproduction vocabulary covering animals and people. For “parturition”, the A.-Sax. records give cenning (with k-), a derivative of gen-, ken- “parturition, birth, procreation, generation”, see gene. That is connected with the Türkic word kin for female genitalia with extension to “new child”, and with bearn “born (child)” fr. Türkic verb ber- (see bear (v)): bearncennicge “mother”, bearngestreon “procreation of children”. Cognates: OFr. delivrer “childbirth”, LLat. deliberare “free”. No other IE cognates, attesting that these cognates are loanwords into the IE family. One language carries one prong meaning, and the other another prong, attesting that each had both prongs. Both cognates are late arrivals, datable ca 1000AD ±few centuries. The “IE etymology” of the European cognates confuses the issue with an unsustainable compound of de- “away” + root lib-/liv- “rid, free” (Cf. liberty), a trick repeated over again with a prefix treated as a substantive. It confuses the root döl- with a prefix, semantically mishandles parturition, turns a late derivative deliver into an independent prime root, and adds a new word to the existing native vocabularies. The prime root döle- carried the notion “deliver” millenniums before it reached the LLat./OFr. and finally Eng. (ca 11th c.). The late arrival points to Burgunds as a source for both OFr. and LLat. borrowings. Cognate distribution is confined to few European languages, and consistency in form points to a single-source origin. The fossilized morphology (iver, i + ver, iv + er) points to the origin from that source. Plausibly, -iver is a dialectal Romanization of the Türkic suffix -ur that forms absolute or indefinite tense participle and/or is a predicate marker (dölür). A Türkic “birthing” quintet is a trade terminology passed from generation of “birthing” practicians to their “birthing” daughters and granddaughters with other “secrets of trade”. It can serve as unequaled linguistic marker inherited from generation to generation immune to all societal turmoils. The paradigmatic presence in English of the “birthing” quintet “delivery” (döl-), “doll” (döl-), “gene” (ken-), “bear” (v) (ber-), “kin” (kin, kün/kun) indelibly attests to its origin from the Türkic milieu independently of the paths that brought them to English. See bear, doll, gene, kin.

English delve (v.) “cut into” ~ Türkic del-/tel-, deš-/teš- (v.) “pierce, punch, break through”. The -l- and -š- forms are about equally spread, frequently in the same languages, and even formed a paired jingle a la delik-dešik “Swiss cheese”. Its many reflexes in IE languages were dutifully noted and cheerfully explained away. Cognates: A.-Sax. delfan “to dig”, OSax delban, MHG telben, “to dig”; Balt. (Lith.) delba “crowbar”, Sl. (Pol.) dłubać, (Czech) dlabati, (Russ.) dolbit (äîëáèòü) “crack, peck, pound”; Tr. telik/tešük “hole, aperture, puncture”. All European cognates are Gmc./Balt./Sl., solely N. European, that should have been a salient message to the IE philologists. The Türkic cognates extend across Eurasia. No IE cognates, the “PIE reconstructions” are unattested quasi-scientific fantasies. The word belongs to the host of sibling lexemes shared exclusively by the Gmc., Balto-Sl., and Türkic families. Notably, the sibligs tel- and teš- are noted in the same body of literature, attesting that the archaic sound change l ~ š lingered in the Oguz languages (Cf. EDT, OTD, EDTL). The form -l- was attested in the Ogur Karluk language of the Huns, and the form -š- is present, for example, in the Chuv. The suffix -fan/-ban/-ben and its truncated (Sarmatian) forms -ba/-bi/-v, specific to the Gmc./Balt./Sl. groups, is a palatalized reflex of the Türkic suffix -pan that forms participles: digging, piercing. The consistent use of the combination of Türkic root and suffix is a case of paradigmatic transfer, vividly attesting to the common origin from the Türkic scene.

English descend (v.) “move lower, plunge, fall, go down” ~ Türkic düš- (v.) “lower, fall, pour”. With the suffix -an/-en the verbal stem düš- gains numerous applications: voice, participle, reflexive, noun, plus grammatical forms with nasal suffix -aŋ/-eŋ rendered -an/-en in Gmc. languages. The Türkic original form düšen covered most of the practical usages. The Türkic düš- served as a base for the Lat. prefix de- “diminish” which propagated to the local languages within the sphere of Lat. cultural influence (Cf. Lat. pre- fr. Tr. bir “one” etc.). Cognates: OFr. descendre (10th c.) “go down, fall into, originate in, dismount”, douche (v., n.) “spray”, Lat. descendere “come down, sink”; Rum. dušis “stumbling”. The word is absent in the bulk of the IE languages but has widespread distribution in the Eurasian steppe belt from Atlantic to Pacific. The IE etymology interprets the Lat. form as de- + scandere “climb”. That would inaugurate it as a proto-form for the Türkic family, an obvious guff since the word düš- is an unmistakable loanword into the IE family. The Lat.-based etymology unwittingly leads to another Türkic stem, sön- (v.) “die down, disappear”. Two Türkic sibling words, sön- and siŋ- (sin-) “sink” have overlapping meanings and appear to be original allophones that gained individual semantic overtones. The sön- does not mandate disappearance from the view, and the siŋ- implies that it is stronger on disappearance, that's how in our houses we have sinks and not songs. Parsing descend as de- + scandere unwittingly leads to another series of cognates: A.-Sax. senc(an) “sink, plunge”, saeg(an) “sink, settle”; OFr. (des)cen(sion) “descent”; Lat. (des)cen(dere) “descend”; Sl. sig(at), sgin(ut) (ñèãàòü, ñãèíóòü) “plunge, disappear”. It is inconceivable that Anglo-Saxons and Slavs would take a Lat. loanword, strip the prefix, and spread a stripped stem among their speakers. The proffered Lat. origin with the compound of de- “down” + scandere “climb” is also a nonsensical idea of using “climb” for “descend”. The “climb” is a derivative of “spring, leap, ascent” with cognates like “hasten, leap, jump”, a totally uncouth logic for notions of falling and going down. The idea of a Lat. origin must be tossed out. The evidence suggests that Eng, Lat., and Sl. are using a Türkic word, obtained independently of each other via different paths. In all cases phonetics is close, and retained semantics is pinpointed and exact. The OFr. form is an apparent candidate for acquisition fr. the Burgund-Provence language. The Eng. word demonstrates a separate path since it is not shared by other Gmc. sister languages. The Lat. form, if it is Classical Lat. and not LLat., also demonstrates a separate path since it could not originate neither in Fr. nor Eng. The notion of descendant “originate in” has already been recorded in the OFr. form, positively excluding an Eng. innovation. The A.-Sax. form eventually conflated with its OFr. allophone. See toss.

English dip (v.), deep (n., adj.) ~ tüb, tüp, düb, dip (n.) “bottom”. Ultimately fr. a notion “root” (n.), hence homophonic cognates denoting roots and tubular structures, and foundation, substrate: tuber, tube, tablet, table, (bath)tub, type (impression), type (precursor), typing, etc. There is a slew of Türkic derivatives that includes a verb “submerge”, “go under water”. Eng. has internalized the word practically without changes. Cognates: A.-Sax. diepan “immerse, dip”, deop “deep”, OSax. diop, OFris. diap, ONorse djupr, OHG tiufi, Goth. diups, Du. diep, Dan. dyb, Gmn. tief, teufe “deep”; Ir. domhain (b/m alternation), Welsh (Cimr.) dwfn “deep”; Balto-Sl. (Lith.) dugnas (< dubnas) “bottom”, dubus “deep”, dubti “immerse, dip”, dauba “ravine”, ((Latv.) dubens, dibens, dibins “bottom”, dubt “immerse, dip”, dauba “ravine”, (OPruss.) padaubis “valley”, (OCS) duno (< dbno) “bottom, foundation”; Brahmi tüp “bottom”; Mong. (Kalm.) töb “foundation”; Akkadian dibdibbu “water clock, clepsydra”, reduplicated, earliest record of 28-24 cc. BC. The word has cognates in all Gmc. and Balto-Sl. languages. No palatable IE etymology. A “PG etymology” suggests a faux “proto-word” *deupaz, a derivative of a faux Pan-PIE *dʰewb-, unwittingly a slightly distorted clone of the attested Türkic tüb, düb, dip with a Hawaiian-type articulation. The “proto-word” must be a neologism (Leiden, 2013): the Proto-Judaic Akkadian word could not have survived as a “PG proto-word” during the scientific period of the Aryan-race cleansing. Since the notion “deep” is an extension of the notion tube (< “root”), to the host of cognates belong all worldly cognates of the tü:b “tube”. The massive assemblage in English of the related Türkic derivatives of the prime notion “root” magnificently attests to their common origin from the Türkic linguistic milieu. See table, tablet, tuber.

English divide (v.) “split, separate, fraction” divvy “apportion, deal, share” ~ Türkic til-/dil- “cut to pieces”. Cognates: A.-Sax. daelan, Goth. dailjan, OHG teilen, Icl. deila, Norse divide, Du. delen, Yiddish Tyyln; Ir. deighilt; Lat. divide, Rum. diviza; Gk. diaírei (διαίρει); Balto-Sl. (Latv.) dalīt, (Lith.) dalyti, (Sl.) delit (äåëèòü), (Pol.) dzielić, (Lusatian) źělis (jelis), all “divide”, (Serb.) dilum “part”. All forms are allophones, with the same semantics. The l > v transition is illustrated by juxtaposition of Gmc. vs. Romance forms, the initial d-/t- has a form dz-, j- in Pol. and Lusatian, Gk. transition is l- > r-. All allophones ascend to the Türkic stem til-/dil-, apparently from the Sarmatian tribes, except for Gk. and Ir. that point to separate independent paths, with Lat. possibly an allophone of Ir. with gh > v transition. English, in addition to divide and divvy, has derivatives deal, division, till (booty for division). The antiquity of the substrate and cultural borrowing is attested by numerous and various derivatives developed in the individual languages. Notably, Türkic has other words for “divide”, bӧl- (e.g. parse) and kes-, which also found their way into the languages of diverse linguistic groups from the Europe to the Far East. Paradigmatic transfer of a series of lexemes, including a homophone til- “tell” attests to a common origin from the Türkic phylum. See deal, division, tell, till.

English do (v.) “make, act, perform, cause, put, place” (Sw N/A, F24, 1.89%) ~ Türkic tur-/dur-, Chuv., Uig. tu- (v.), a truncated form, “do, act”. The CT polysemantic verb tur-/dur- conveys among other derivative notions “intent or readiness to act, duration or permanence of actions”. It is a grammatical function verb and auxiliary verb of action, functionally eidetic to the Eng. pair to do (infinitive case). “Among Turkic verbs with grammatized functions, tur-/dur- occupies a special place in terms of universality, richness and diversity of its grammatical application” (Sevortyan EDTL, 1980, vol. 3 p. 299). Eng. did not carry over the wealth of concrete applications, while closely reflecting the universal quality of the lexeme. Cognates: A.-Sax. don, dide, didon, dyde, dydon “to do”, doere “doer”, OSax. duan, OFris. dua, Du. doen, OHG tuon, Gmn. tun; Türkic cognates incl. form tu. There are no actual IE cognates, a dreamed up “IE root” *dhe- “set, put, place” is a flight of fantasy derived solely from the Gmc. roots and accordingly mimicking the Türkic original. The suggested “IE root” semantic trails another aspect of the Türkic tur-/dur- “stand (physically or to stop)”. Distribution is typical and peculiar: the width of the Eurasian Steppe Belt plus a smattering in the North-East of the European peninsula. The Türkic stem tö-/tü- serves in numerous derivatives expressing semantics of “make, made”: törü- “happen to occur, emerge, be born, appear, give birth”, törüt “create”, törči “happen, occur, undertake, initiate”, törči also serves as auxiliary verb exactly like English do, with a similar complement of functions: “make, engage, carry out, carry on, get done, proceed, cause to happen, engage in, comport, execute, finish, complete action” with idioms and nuances, tükät “completeness, completion of action”. Like the compound of the type hairdo, doable, do-gooder, English has a compound kindred “related” that appear to be a transposition of the Türkic compound törkün “my clan, my tribe, house of blood relatives” = tör + kün ~ “do + kin” > kin + dred > kindred. In that case in the compound the English and Türkic components are identical, tör ~ do and kün (like in the ethnonym Hun) ~ kin. Even wider range of Türkic derivatives is developed with the agglutinated suffixes. The series of do, make, act, and other action lexemes form a case of paradigmatic transfer, inescapably attesting to a common origin from the Türkic phylum. See act, make, kin, to (prep.).

English don (v.) “put clothes on” ~ Türkic don-, ton- (v., n.) “put clothes on” (v.), “clothes” (n.). Nearly forgotten, the word is pretty much active in idiomatic expressions: “it donned on me”, “why don't you don the evening dress”, etc. Cognates: Anglo-Sax (OE) (onscry)dan “to clothe”, (scry)dan “vestry”; taken from the church lingo, scrydan is “sacral vestments”, tunece, tonice “garment”; Ir. guna “clothes”, Welsh (ao)dach “clothes”; Ru. oden (îäåíü) “don” (imperative); Bolg. donove “attire”; Lat. tunica “tunic”, toga “drape”; Gk. chitîna (χιτωνα) “tunic”; Pers. pufidan “wear, clothe”; Khotanese thauna- “clothe”; Kannada (u)ḍuge (ಉಡುಗೆ) “don”; Brahmi tom; Mong. daashinz (äààøèíç) “clothes”, tonug “horse harness”; Kor. deuleseu (드레스, /tirisi/) “clothes”, Jap doresu (/donese/) “clothes”; Tat. tyŋ- (v., n) ditto. The phonetic and semantic concurrence is perfect. Distribution across Eurasia and linguistic families, and a positive oddball within the IE family, attest a loanword status in the IE languages. No IE etymology. The offered folk-type IE etymology, a neologism “contraction of do on” is spurious and laughable. It implies that an honor of being a proto-word belongs not even to Brahmi but to Eng. It is unconscionable that an abundance of the cognates on the European peninsula and across Eurasia went unnoticed. The etymology for the Lat. words and for the Skt. sthag- from an unattested “PIE root” *(s)teg- “cover, roof, roofing material” semantically echoes the notion “cover, clothes”, thus likely a 3rd c. BC allophone of the form don/ton. The first consonant is fairly consistent across Eurasia, the second consonant is more fluid, attesting that the original -n- might have had a peculiar articulation, Cf. ñ, ñ, ŋ. The Tat. form corroborates that speculation. The t- (OTD)/d- (Eng.) shift is a strictly dialectal variation, both forms coexisted from early times. G. Clauson (EDT p. 512 on) pinpoints the form don- (vs. ton-) specifically to Azeri, Turkmen, and Ottoman Turkish, i.e. roughly the Aral-Caspian interfluvial. That is consistent with other pointers to the Aral origin of the Gmc. component. Curiously, like the Türkic generic agach “tree” became the Gk. acacia for a specific type of trees, so the Türkic generic ton/toŋ “dress, clothing” became Lat. for a specific types of dress. The word falls into a pattern of paradigmatic transfer as a member of a baker's half-dozen dress terms, directly attesting to their Türkic origin. See belt, cowl, robe, sari, sash.

English drag (v.) “pull”, (n.) “hindrance, impediment” ~ Türkic tar-, tart-/dart-, tarït- (v.) “pull, drag, draw”. The form tarït- and its contracted form tart-/dart- are intensive aspect of the base tar-. The verb is neutral to transitivity, attesting to its origin prior to the rise of the grammatical transitivity. The verb is very polysemantic, numbering 10 main semantic fields, plus 15 semantic offshoots, plus a scattering of 17+ single senses in separate sources, plus uncounted subsenses. A listing of distinct semantic meanings numbers 128 entries, likely a Türkic and possibly among the world's semantic champions. The mind-boggling semantic spread attests to its amalgamative origin from once independent languages. The base meaning of the root tar- is “pull, drag, draw”, shared by most of the Türkic languages, it alone has 30+ distinct semantic flavors. The other main senses of the main semantic fields are “drag on/trail”, “continue/last”, “jerk/yank”, “weigh”, “ponder”, etc. Numerous words form idioms. In the processes of an immediate inheritance, and direct and secondary loanword acquisition, such prodigious polysemantic word had to leave a stupendous trail within the daughter languages and a series of culturally impacted languages. The Gmc. languages were immediate beneficiaries of the inheritance, Cf. Eng. drag, draw, dredge, dregs, drive, tardy, trace, track, tract, tractile, traction, tractor, tradition, traffic, trail, train, tram, trammel, trap, travel, traverse, trawl, tray, tread, truck, retard, and more. The whole shmear probably started with a pair of logs dragged by draft animals with rollers and wheels added millenniums later. The original Stone Age technology lasts well into the 21st c. in snowy and swampy areas where wheeled transport does not work, now mostly in sledging and logging. Time has expanded semantic range with an explosion of derivative innovations related to pulling, like the northern runners and the southern rollers and then the wheels. They ascend to the Stone Age technologies. The published cognates are but a small and selective sampling. An attempt to trace all cognates of the 128 meanings would be a gargantuan task. Cognates: A.-Sax. dragan, draege, drogon, droh, drohnian, drohtnian, trog, troh, ðroh, trugon “drag, draw, protract”, daerst, daerste “dregs, refuse”, droht “condition of life”, drohtað “mode of living, conduct, environment, society, condition, employment, vocation”, drohtian, drohtnian “behave, associate, live, continue”, trod, trodu “track, trace”, OSax. dragan “carry”, OFris. drega, draga, ONorse draga “draw, drag, pull”, dorg- “fishing tackle”, OIcl. draga “drag, pull”, dorg “fishing tackle”, Goth. dragan “carry, drag”, draibjan, dreiban “drive”, OHG tragan “carry, bring, lead”, Gmn. tragen “carry, bear”, darge “fishing tackle”, MDu. draghen “carry, bring, throw”, dorg- “fishing tackle”, Sw. drag “to tense, draw”, drag “long narrow depression in soil (trace, trail)”, dörj, dorj “fishing tackle”; OIr. tarla “pull”; Lat. trahere “draw”; Gk. travixte (τραβηξτε) “pull”; Rus. doroga (äîðîãà) “trail, road, way”, doroga, dorojka (äîðîãà, äîðîæêà) “fishing tackle”, droga (äðîãà) “beam (between wagon axles”, drogi (äðîãè), drovni (äðîâíè), aka rozvalni (ðîçâàëüíè), volokusha (âîëîêóøà), volochuga (âîëî÷óãà) “sledge, stone-boat”, drojka, drojki (äðîæêà, äðîæêè) “light carriage”, tarabit, tarabanit (òàðàáèòü, òàðàáàíèòü) “drag, pull”, tarabarka (òàðàáàðêà) “traverse, crossbeam” (Türk. compound with ber- “bear, carry” and diminutive suffix -k), tarak (òàðàê) “pole, rod (for drying nets)” (Tat. tartyq, with instr. suffix -yq/-ïq), tarantas (òàðàíòàñ) (Tat. taryntas) “long-base wagon”; Pol. dorozka “light carriage”, darka “fishing tackle”, taratatka, taradajka, taradejka “wagon”, òàðàíêàò “trail, lag, linger”, tarambuchit “cling, annoy” (Fr. tarabuster “cling, annoy”); Skt. dhrajati “pull, slide in”; Fin. tarttu- “seize, catch”, tork(k)o “fishing tackle”, Mordvin targa- “pull”; ODravidian tor, dor, tallu “push”; Nenets tartaɣaš “pull”; Kalmyk, Mong., Manchu tatxa “drag, pull, pull over” (tata- < tarta-); Kor. talda, tada 달다 “weigh, weighing” (< tara-, < tan-); Chuv. turt- “pull”; Alb. tartis “weigh, weighing”. The first consonant tends to wobble between d- and t-, like in Türkic languages, the first vowel wobbles between -a-/-o-/-ö- and zero. Numerous words carry traces of the Türkic suffixes with -g/-ɣ/-k/-q for forming conjugations, declensions, and derivatives. Any claims to an “IE origin” and “PIE proto-words” are doomed by the spread of the word across linguistic families and continent-wide vast geography, impossible without mobile Türkic nomadic pastoralists spreading the word across Eurasia from Atlantic to Pacific. Forsaking its creed concept of the “proto-wording”, the IE etymology invented a slew of different “proto-words” pleading a special case for each special instance descended from a primeval notion of dragging a sledge or a fish out of water. The ethereal cornucopia includes the “PIE roots” *dhregh- “draw, drag on the ground”, *der-, *dreb- “run”, *do- “give”, *drou-, *deru- “firm, solid”, *pag-, *pakslo- “fasten”, *drogon “wheel”; the “PG roots” *draganan “draw, pull”, *tred- “tread, step”, *trep- “snare, trap”, *traujam “flat wooden board with a low rim”, “OE roots” *dreaht, *dræht, and the “VLat. roots” *tragulare “to drag”, *tragere *traginare “to pull”,*tardivus “slow, sluggish”, *transfricare “rub across”, *tripaliare “to torture”, *tripalium “instrument of torture”, and many more. Notably, the VLat. inventions are demonstrably anachronic, since they refer to the post-Latin words of the New Era and can add nothing to etymological tracings. Instead of ascending to a base root connected with various aspects of the notion “pull, drag, draw”, this industry manufactures individual “proto-words” rammed to fit into a narrow linguistic window. The distribution of the word in the Northeastern Europe points to the amalgamation within the Corded Ware (3rd mill. BC) archeological culture. The Celtic form is significant in that the Celtic Kurgans arrived to Europe in ca 2800 BC, 800 years before the time of the Aryan southeastern migration. The consonance of the Far Eastern forms with the Türkic tar- corroborates that the Zhou-type “Scythians” at the turn of the 2nd millennium BC reached the Pacific belt and amalgamated with the local populations. Without demography of the physical carriers conveying cultural influence, there is no place for the faux “PIE” root *dhregh- to penetrate innumerous far-flung alien languages. There is no other alternative for the Stone Age-technology society to spread the word across Eurasia from Atlantic to Pacific except for the mobile Türkic nomadic pastoralists. Of the recorded Türkic 128 meanings based on the root tar-, quite a few words from the other 9 main semantic fields had to be disseminated across the languages of the Eurasia and leave their traces in the local languages, like the notions of the weighing and measurement cited for the Alb. and Kor. languages. A real etymology has wide open fascinating perspectives.

English earn (v.) “deserve by efforts or actions” ~ Türkic ar-, (er-, jer-) (v.) “tire, weary”, “get tired, get weary”, i.e. “after hard labors”. A reflexive form arïn with Ogur prosthetic initial consonant produced a synonymous form garner. In the antiquity and middle ages, the literary examples of the notion “earn” regularly deal with politics and military affairs, never refering to any harvests or autumns. Cognates: A.-Sax. earnian “deserve, earn, get a reward for labor”, OFris. esna “reward, pay”; Sum. ir, Hu. erni “to be worth, deserving”. By phonetic resemblance, the verb with the semantics “garnered” was etymologically confused with the noun “harvest” and its verbal version “to harvest” conflated with “autumn”: A.-Sax. ern “harvest”, ONorse önn “work in the field”, OHG arnon “to reap”, aren “harvest, crop”, Gmn. Ernte “harvest”, Goth. asans “harvest, summer”, OCS jeseni, Russ. osen, OPruss. assanis “autumn”, all unrelated to the notion “deserve, garner, gain through efforts”. The IE etymology for garner is equally naively fictitious, it connects it with a separate unattested “PIE” fictitious root for “grain”. The Sum. ir and Hu. erni, like the Türkic “having labored hard”, provide a direct semantic correspondence and phonetic match, and also take the word earn from the confines of a peasant labor to the larger world of contracts, mercenaries, and obligations that reflect the substance of the word: “earned salary”, “earned living”, “earned trust”, “profited from laborious activity”. The forms earn and garner attest to two separate demographic paths, one Oguz and the other Ogur, which brought to English two forms of the same word. They grew to two distinct shades of the same notion.

English eat (v.) “to eat, devour, consume” (Sw N/A, F545, 0.03%) ~ Türkic ye-, ašà- (ashà-) “eat”, with suggested variation dye-. Ultimately, both forms probably ascend to a single-vowel word, a what is now ı: “vegetation”, as a generic appellation for any plant food. The synonymous ye- and ašà- have nuanced semantics, ye- is a mundane act of eating, ašà- is a dignified process of eating, Cf. Sl. otkushat, potchevat (îòêóøàòü, ïîò÷åâàòü) “dine” vs. kushat (êóøàòü) “eat”. The ašà- has a tint of nobility; thus the Eng. eat and Gmn. essen constitute a bifurcated paradigmatic transfer of the same notion, semantically akin to the English usage of thou vs. sg. you. The second member of the Türkic idiom aša- ič- “eat and drink” has been preserved in the A.-Sax. ðicgan and ge-ðicgan (with -ch-) “drink” (and “eat”), another case of paradigmatic transfer. Türkic has the form yedi/ye:di: with conjugational suffix -di, that suffix is widely used as an indefinite form marker by the IE languages with the roots in the Eastern Europe of the 3rd mill. BC, Cf. Gmc. it/et, Gk. edo (ἔδω), Lat. edi, Skt. atti, “eat”, Sl. yesti (jåñòè), eda “food”, see edacity. The immense variety of forms and the Eurasian spread attest to a supremely long history, Cf. modern Türkic forms ye-, ij-, či-, i-, e-, ije-,'im-, em-, em-, če-, cie-, či-. Cognates: A.-Sax. et (v.) “eat”, OFris. ita, OSw. etan, MDu. eten, Du. eten, OHG ezzan, Gmn. essen, ONorse eta, Goth. itan. Baltic forms are êst, īst, ėmi, ę̄du; Slavic forms isti/ests/jåñòè/jėsti/jisti/jesc; Skt. atti “eat”, ye:k “demon” lit. Tr. “devourer”; Arm. utem (Türkic 1st pers. sing), Gk. edo (εδω), esthio (εσθίω), estho (εσθω); Lat. edi; Chinese 吃 (chi); Mong. ide- (syn. asara, another Turkism, Cf. Gmn. essen). The Türkic prosthetic consonant ch-/j- in ye, i.e. či, če, cie, či, Slavic jåñòè/jėsti/jisti/jesc “eat”, eda “food”, and Chinese form 去 ((shi), with Türkic transposed prosthetic -j-/-y- in ij, ije points to Ogur form i/e > ye/chi/che, vs. Gmc., Balt., Slavic, Skt., and Arm. unadulterated i/e forms; the Türkic auslaut suffix -ta/-tan/-ten/-zen/-sen is agglutinative marker related to grammatical person and tense. The spread of the forms it- (at-, et-, ut-)/id- (ad-, ed-, ud-) may alternatively point to the original form it-/id-. Then the modern Türkic form ye- is a contraction; that links the Mong. form with Eng./Gmc. forms via some intermediary like Scythians (Cf. Gk./Lat./Sl. form), Khakass or the Eastern Huns. The Gmn. form essen (v.) may alternatively be a desiderative form ye:se:- of ye- “want to eat, hungry”, or arise to the Türkic noun aš (ash) “food”, verb aša:- (asha-) “eat”. The Skt. ye:k is peculiar and diagnostic, it is a component of a duo of Skt. Buddhist terms of Türkic origin, ye:k “demon” lit. “devourer”, and yakša “demon” lit. “gluttonous, desirous to englut”; they complement the Skt. Buddhist bhüta “ghost”, in Türkic ičgek (ičkak) “drinker, vampire”. The significance of the duo is that these Skt. Buddhist terms use Türkic terminology carrying transparent Türkic etymology, complete with the desiderative suffix -ša, with minor phonetic adaptations, and semantic field expanded from concrete nouns to generic concept. These Skt. Buddhist terms set an upper dating limit of 6th c. BC, they provide attestation to the form of the Türkic lexemes at the time of their adaptation. The terms may be much older than Buddhism, which likely used an inherited Skt. lexicon to express a new concept. The Chinese word is likely a reflex of the Scythian Zhou component in the Chinese language. The spread of the word is impressive, with terminal points in Western, Northern, and Mediterranean Europe in the west, India in the south, China and Manchuria in the east, and the Eurasian Steppe belt in the north. The distribution of the forms across families points to an unequaled reach across Eurasia. The multi-faceted aspects of paradigmatic transfers impress with their attestations of traceable and vivid genetic connection. See chew, edacity, vampire.

English eke (eke out) (v.) “live from day to day, with difficulties, hardship, precarious” ~ Türkic ek- (v.) metaphorical “eke out”, “with efforts and strain”. Ultimately fr. a polysemantic verb ek- “till (farming work)”, Cf. iki “two, secondary”. Ekin is a noun with the deverbal notions of insufficiency and sourness, acridity; the suffix is retained in the Gmc. languages. Türkic derivatives include verbs eklä- (erkla-) “apply efforts, strain”, eksï- (eksü-) “diminish, reduce, lack”, eksük “shortage, loss, inadequate, defective”, eken “indefinity, being, while”, and erki: “perhaps, uncertain, doubt”, etc. They carry a specific notion of precarious transitivity. Semantics includes “supplementary, supplemental, additional, extra, further, addendum, extension”, etc., Cf. the A.-Sax. eaca “increase”. Parallel derivatives carry a notion of sourness, applied in metaphorical and physical context, that conflates sourness and hardship: ekši:- (v.) “sour, acid, tart”, ekšig (adj.) ditto. The ek- words echo the stem erk- denoting “force, power”, reflected in the unit of energy/work erg and in the Gk. ergon (εργων) “work/task”. Other than some conflation and phonetic homophony, the stem erk- is not related to the stem ek- and its Eng. sibling eke. Cognates: A.-Sax. echen, ecan, eacan, eacian, ONorse auka, OFris. aka, OHG ouhhon, Goth. aukan, wahsjan, Gmn. wachsen, all expressing an abstract notion of “augment, get along, handle hardship, overcome”; Kor. ekta- (v.) “harrow”. Direct European cognates are confined to Gmc. languages. Distribution attests to a non-IE origin of the word, the negative semantics is not present in the balance of the IE languages. It is fairly significant that Korean farmers used a Türkic farming word. A suggested “PIE proto-word” *aug- “grow, increase”, rated as a cognate of ek-, is likely an allophonic form of it, though it is lacking a notion of deficiency present in Eng. The affinity is supported by the farming semantics of the form *aug-. The semantic and phonetic analogy would add Lith. augu, augti, aukštas, Lat. augmentum, augere, Gk. auxo (αυξω), auxein (αυξειν), Skt. ojas-, vaksayati, all expressing “grow, increase, supplement” to the above cognate line-up. The uniqueness of the semantics, perfect phonetics, a farming connotation, and peculiar distribution makes the Türkic origin certain. Given the uniqueness of the word, a chance coincidence is a blunt impossibility.

English fare (v., n.) “getting along, status, state of affairs” ~ Türkic ber- (v.) “bear, carry, give, convey”. The Türkic ber- (v.) is extremely polysemantic: “give, bring, send” are most popular of the 13 semantic clusters and 13 grammatical functions, see bear (v.). The b/f alternation is regular, f is typical for the East European or West Eurasian palatalized pronunciation independent of the linguistic family association. Cognizant with the verb fare are expressions like “How you are faring?” i.e. “How you are doing?” and affairs “matters of personal concern”, “needs”. The English fare “state of affairs” is unrelated to travel, it describes status: welfare, warfare, workfare, lawfare (legalistic war), vs. the travel's wayfare, wayfarer, farewell, eelfare (runs of eels), thoroughfare, port, part, depart, etc., from a similar stem bar- “part, disappear, go, walk”, see port. Cognates abound. Cognates: A.-Sax., OHG beran, Goth. bairan “carry”, ONorse bera “barrow”; OIr. beru/berim “produce”; Balto-Sl. (Latv.) parvest, (Lith.) pervežti “carry” (OCS) birati “take” (opposite of “give”), (Ru.) bremya “burden”, Lat. fero, portare; Gk. pherein; Arm. berem; Skt. bharati, Av. baraiti “carries”, OPers. bara- “carry”. Gmc. forms come with Türkic suffix -an for active verbs. Balto-Sl., Skt. and Av. forms are cited with a Türkic 3rd pers. sg. suffix -ti/-di. Lat. form is cited with the Türkic imperative suffix -ar/-er. OIr. and Arm. forms are cited with a Türkic suffix -im indicating 1st person “(I) carry”. The English verb fare apparently conflated with the notions for travel and movement, Cf. semantically unrelated far, forward, and fare “payments”, Cf. A.-Sax. faran “to journey”, fara “travelling companion, comrade”, etc., with germane derivative verbs and nouns for tickets and food of the Türkic verb bar-, q.v. The IE etymology skips on the “state of affairs” side and instead dwells on the unrelated ticket-travel side. In the end, the first homonym is left with no etymology, and the second homonym receives an imposter. The conflation grew to a degree where some neologisms can't be attributed to one or the other, like fanfare. The others appear nothing to do with either one: fieldfare, fellfare. Both Türkic verbs relay some type of motion, the initial distinctions did not survive internalization by numerous diverse vernaculars, leading to phonetic and semantic crossovers, Cf. Skt. parayati “carry” fr. ber- “carry” vs. Av. peretush “passage, ford, bridge” and Lat. porta “gate, door” fr. bar- “go”. The paradigmatic transfer of phonetic and semantic patterns of the duo attests to the genetic connections between these lines of the siblings from the Türkic phylum. See bear, depart, port.

English fart (v., n.) “expel intestinal gases through anus, flatus ventris” ~ Türkic burt-, burut- (v.) “expel intestinal gases through anus, smell badly”. Ultimately a verbal denoun derivative fr. a noun/verb base bu/buru/pu “vapor, steam”, see cloud, mist. Cognates: A.-Sax. feortan, OHG ferzan, ONorse freta, Dan prut; Rum. bășina; Balt.-Sl. (Lith.) perdzu, (Russ.) perdet; Gk. perdein; Skt. pard; Fin. pieru, Est. pieru; Punjabi pharata; Ch. pi (屁). The Lat. bombulum also belongs to this lexical cluster derived from some 4th-3rd mill. BC forms of bu, bur, and burut-. The IE etymology gives an empty “of imitative origin”, which does not work with the likes of the Gk. klania (κλανια), Lat. crepo “creak”, etc., nor with the Türkic synonyms ïčɣïn- “release”, yel, yil “eaten”, yoruɣ- “go, walk”, osur- “obviate”. The Türkic bu, bur is vapor, gas, hence the Sl. par (ïàð) “vapor, stream, gas”; the verb burt- is a derivative of the stem bu, bur, no need for any “imitative origin”. The suffix -t is applicable to verbs and nouns, thus burt can also be a verb and a noun. A burt- synonym osur- is a cousin of äsir “ether”, the A.-Sax. aeðm “air, breath, breathing, vapor, blast”. Phonetic modifications correspond to the linguistic families of the recipient. Gk. and Skt. have spread the word southwest and southeast in the 2nd mill. BC, the Kurgans took it to Central and Western Europe with the first (5th mill. BC) and the following waves. Together, the A.-Sax. feortan and aeðm, the Türkic burt- and osur-, and the English fart constitute a “farting” transfer paradigm, categorically attesting to the genetic connection of the three languages. The Türkic bu, bur is a spectacularly productive stem playing on a number of steam properties: boiling, bursting, whirling, burning, purging, and the like, it was especially productive for the terminology of the ordinary cooking. Ultimately, derivative cognates of bu, bur are principals like bake, boil, breath, bullion, fart, fire, fog, mist, murky, purge, pyre, (see highlighted entries) and trivia like burp, puke, and possibly turbulence (see turn and boil).

English feel (v., n.) “physical or emotional sensation” (Swadesh N/A, F174, 0.13%) ~ Türkic verb bil- “know, understand, perceive”, and “learn, discern, detect, probe, scout” . In both languages the underlying prime notion is “to know, perceive”. Cognates: A.-Sax. felan “feeling, perceive, touch” with a notion of “perceive through senses of not any special organ”, OSax. (gi)folian, OFris. fela, ONorse falma “grope”, Du. voelen, Dan., Norse føle, Sw. känna (calque), OHG vuolen, Gmn. fühlen, Icl. finnst, Yid. filn “to feel”; Lat. palpare “feel, touch”; Mong. bilig “knowledge”; Ch. ganjue (感觉) (calque). A.-Sax.  parallel word fredan and Sw., Ch. “feel, perceive” are formed out of the same notion “know”. A.-Sax. is derived of a native frod “wise, old”. The transition from a plosive labial to a fricative labial b > f is endemic to the Gmc. group. In Türkic languages, the notion feel fr. the notion know is expressed by four roots, bil-, sez-, tartil-, tuj-. The tartil- is an extension of tart- “pull, guide”. The Türkic developmental model is common to Gmc. and Sl. languages, the Sl. chuvstvovat (÷óâñòâîâàòü) “feel” is a derivative of chu- (÷ó-) “to get wind of, smell, hear”, i.e. “know”. While the Lat. palpare ascends to the Türkic bil-, the Lat. sensum “to feel” is an offspring of the synonymous Türkic root sez- “to come to know, to learn, to feel”. The IE etymology suggests “perhaps ultimately imitative”, essentially resigning to a no meaningful etymology. The English verb feel “feeling (v.)”, a progeny of the A.-Sax. lexis, is a 15th c. articulation of the Türkic bil-. The Gmc. forms, consistent in transitions b-/v-/f- vs. the Italic b-/p- transition, point to independent paths and timing.  Italic b-/p- transition follows a trail of Türkic vernaculars, it has a special diagnostic value. Another utilization of the root bil- with the semantics “able” found employment as a suffix -able in English innovations like suitable and doable, the last a compound of two reflexes, the “do” and bil- “able”. The Eng. forms conflated with the Lat. Turkisms of the same word bil- “able”, the Lat. -abilis, -ibilis, the Eng. -able, -ible. A paradigmatic transfer of two discrete meanings of the same root attest to to the common genetic origin from the Türkic milieu. See feeling.

English find (v.) “discover” (Sw N/A, F156, 0.11%) ~ Türkic ind-, yind- (v.) “find”. Ultimately fr. a verb ir-/er-, is-/es- (s/r alternation) “be, pursuit” with a directional suffix -da/-ta forming a notion “from/to, pursue”, a term probably from the hunter-gatherer days, Cf. Eng. are, is. Form ind- has a counterpart yind- with a prosthetic anlaut consonant, consistent with the Oguz (ind-)/Ogur (yind-) divide. The Ogur form is associated with western Türkic phylum, particularly with the Sarmat area of the northern European antiquity. In all Gmc. forms the Tr. semi-consonant anlaut y- is replaced with voiced or unvoiced labial fricative v-/f-. The same process is observed in the Sl. Turkisms. Cognates: A.-Sax., OSax. find(an), OFris. finda, ONorse finna, MDu. vinden, OHG findan, Gmn. finden, Goth. finþan, all “find”, all very consistent with modern English form find. The -d-/-t-/-þ (th) alternation of the Gmc. languages is consistent with -d-/-t-/-δ- renditions by the European Türkic dictionaries. The IE etymology offers a line-up of semantically unrelated to the notion “find, discover, search, seek” and fairly remote phonetic homophones: “tread, go”, “pedestrian”, “path, way”, “open sea”, “tread, walk”, “bridge”, “heel” somehow related to a dreamed-up unattested PIE root *pent- assembled per Türkic syntax of a root pen- and a deverbal noun suffix -t-, see port. To make things even sillier, the IE etymology does not cite a single word meaning “find” outside of Gmc. group, demonstrating that the word does not belong to the IE family at all. The phonetics, consistent with the Ogur/Oguz alternation, is quite perfect, and the semantics is unambiguously precise, leaving no room for doubts about the origin of the Gmc. word from the Türkic scene.

English gabble, gibber, jabber (v.) “speak chattering, blabbering” ~ Türkic gep-, gap-, gapir- (v.), “speak, conversation” (Gadjieva N.Z, Koklyanova A.A., 1961, p. 365). A conveyed base notion is “to master a technique of speech”. It carries an element of quality, or a judgment, or a shade of conditionality. Like in Eng. “my child has learned to say (söy/söyle)” is an improper form of “my child learned to talk (til/tili)”, so in Türkic instead of improper söy/söyle or til/tili is proper to use gap-/gapir- “my child learned to gabble”. Although overlapping and interchangeable to some degree, each one conveys its own spectrum of very basic notions. The presence of derivatives formed with known, long unproductive archaic Türkic participle suffixes gen-, ges- and others demonstrate that the derivatives have formed when the relict suffixes were still productive, long before the advent of Islam with its Arabisms and Persianisms. The spread, morphology, and the phonetic and semantic variation within the Türkic linguistic family attest to its innate Türkic origin, and that affirms that the Arabic qavl- “word, expression” and the Pers. gep- “boastful speech” are concrete and probably late loanwords. The deverbal noun form gibberish, syntactically spelled in a reciprocal-cooperative aspect form giperiš (giperish) “chat, chitchat”, is indistinguishable from neither the Eng. nor the Türkic original. The attested phonetic variation of the allophones gibber/jabber is consistent with the common Oguz/Ogur phonetic variation of the anlaut consonant, and with an imperative inflection of the verbal stem gib- > gibber. In a feat of paradigmatic transfer, Eng. possesses five main action words related to communication: call, cry, gabble, say, and tell, direct siblings of the Türkic qol-, qar-/čar-, gap-/gapir-, söy-/söyle-, and til-/tili-. In the case of the gap-/gapir-, paradigmatic transfer is layered with a separate and independent morphological aspect. The IE etymology provides no parallels, and suggests origin “imitative of the sound of chatter, probably influenced by jabber”, without specifying in what linguistic family that orphic onomatopoetic form could have evolved. Across the whole dictionary, from gibberellic to giblets, Eng. does not have anything that could sound chattering or impress a sane person as a chatter. The textbook etymology is rather incompetent: the science of etymology is concerned not with gloomy enlightening news of an echoic origin. It is concerned with where this echoic origin came from, and what paths it took to spread. See call, cry, tale, tally, tell, say.

English gaggle (v., n.) “goose talk” ~ Türkic qaɣ quɣ (v., n.) “goose talk”. Cognates: MEng. gaggle, ONorse gagl, Du. gagelen, OIcl. gagl, MHG gagen; Sl. gogotat; Balt. (Lith.) gageti, gagu, Ltv. gagat; Mong. gogotat; Kor. geowitte (거위떼); all referring to geese and by extension to female talk. The IE etymology offers inconceivable “one of the many artificial terms invented in the 15th c.”, an etymology even less convincing than that “of unknown origin”, given that the cognates are spread across N. Europe to Mongolia/Korea, congregate in area demarcated by particular historical distinctions, and except Korea marked by substantial presence of the Kurganians' R1a/b haplogroups. The notation “possibly of imitative origin” for gaggle is not any better, the question is not the dimly enlightened news of the echoic origin, but where this echoic origin came from, and where it spread to.

English gaze (v., n.) “stare” ~ Türkic gör-/kör- (v.) “see, gaze”, göz/köz “eye”, Cf. közin körüp “gaze with eyes”. A third member of the homonymic verbal triplet is giz-, gez-, kez- (v.) “walk, wander, roam, travel, i.e. onlooker, observer”.. The unity of the forms köz/kör- “eye/see” is beyond doubt. A meandering of the Oguz and Ogur speakers during historical period is well established, the extensive pre-historical migrations have been reliably detected and genetically traced. The s/r divide settled the -z form as a noun and the -r form as a verb, with some mystery of detail. The adjectival version “agaze” combines the Türkic göz/köz with an A.-Sax./Gmc. prefix a- to mark the verb as momentary, a single event, Cf. abide, arise, awake, ashamed. A Sl. version prefix za- serves the same function, Cf. zaglyanut (çàãëÿíóòü) “glance in”. Cognates: A.-Sax. gor(ian), goret(tan), goret(tung) (with -r) “gaze, stare”, caes, ceas, ces, cys, ceos(an) (with -s) “seek, choose”, Goth. geisnan “scared, aghast” (us-geisnan ditto), Norse, Sw. gasa (with -s) “stare”, Fris., Gmn. kieze “choose”, Du. kiezen “seek”; Ir. cas “see”, Welsh ceis(iwch) “seek”; Hu. keresni “choose”; Turkish gözünü (with -s) “to eye (look)”. No IE etymology whatsoever, the ubiquity of the homonyms in most of the Earth languages notwithstanding. The semantic extension from “see” to “seek” and “choose” was predicated by the ingrained semantics of the native word for “see” in the receptor languages, Cf. 20+ semantic extensions for the “see” in English. The semantics “to gaze” of the Türkic gezer/gizer/kezer is quite natural, especially so since the prime meaning of the verb gez-/giz-/kez- is “travel, wander”, nowadays called sightseeing and tourism. The prime notion of the word giz shifted to “wonderer”, its kinship with the göz/gör- “seer/gaze” is retained as a metaphorical expression. Ligatures -ae-, -ea-, -eo- and the like reflect attempts to express rounded vowel with means of the Roman alphabet, a millenniums-long struggle, it makes English spelling a curious wonder. The modern spotty coexistence of dichtonomy köz and kör among Türkic languages attests to a stability of the phenomenon across millenniums. Particularly, it passed the melting pot of the Corded Ware culture of the 3rd mill. BC exemplified by dichtonomy across Gmc. languages. The Celtic examples suggest a primacy of the -s- form, at least in the E. Europe prior to the Celtic circum-Mediterranean migration. The Eurasian spread, largely limited to the Türkic and Gmc. families, attests that host populations along migratory paths had firmly established terms for “eye” and “see”. The carryover of the duplex köz/kör- constitutes a case of paradigmatic transfer, indelibly attesting to the common origin. The Türkic origin is preeminent, there are no other contenders. See gist.

English get (v.) “obtain, receive”, getter (n.) “who gets, gains, obtains, acquires, begets, procreates” (Sw N/A, F42, 1.03%) ~ Türkic get-, göt-, ket-, köt- (v.) “bring, carry, deliver”. Ultimately fr. the verbal root gel-/kel- “come”, Cf. Kelt/Celt (n.) “newcomer, comer, arriver” formed with an abstract noun suffix -t, see port. The causative form geltir assimilated to getir ~ ötir, modern Turkish infinitive form götmek “bring, carry”, lit. “bring make”. That etymological observation has already been made in early Turkology, laid out or contributed by J. Zenker (1866), L. Budagov (1871), H. Vambery (1878), W. Radloff (1910), M. Räsänen (1969), (all cited by Sevortyan 1980, vol. 3, p. 31). Whether original or assimilation, the existence of the root get-/göt- “bring, carry, deliver” is firmly established. A counterpart to the positive form göt- is a negative git- (G. Clauson EDT ke:ter-) “leave, disappear” with about the same extent of applications. It is also confusable with kel- “come”, (q.v., and G. Clauson EDT pp. 716, 706, 705, etc.). The word has some rich and fuzzy semantics from “fetch” to “translate”, overlapping with Eng. in prime meanings. It lacks the universality of the Eng. word as auxiliary plus about 36 meanings. In English, the semantics of the word expanded from a generic “receive, obtain, bring” to include a narrow “get (understand) the idea”, indicating semantic assimilation by alien speakers. Cognates: A.-Sax. giet-, gotten, gat, gitsian “attain”,(be)gietan, (be)geotan “get, find, acquire, attain, receive, take, seize, happen”, gitsian “attain”, (for)gietan, (for)git- “forget”, almost exclusively in compounds, numerous derivatives connected with “greed”, ONorse geta, gatum, very broad meaning, OSw. gissa “to guess (metaphoric)”, Goth. gatilon “attain, obtain”, gatiuhan “bring, take, draw, lead”, Dan. gæt, Gmn. geitzen; Hu. gondolom “I get it”; Kor. gat-ayo 같아요 “I get it”. The word has survived in the Balkans as a trading term on a wholesale supply of  delivered goods: Serb. djuture, Rum. gjoture, Alb. kuturu, Rus. gurt, gurtom (ãóðò, ãóðòîì), Cf. Turkish götürü. Suggested further cognates of the type ges- “guess” are inapplicable, they exploit a colloquial semantic extension of the get to shift from direct “obtain, receive” to metaphoric “get it (idea, news, etc.)”. There is a paucity of cognates, they are concentrated in peculiar Gmc. and Balkan points in peculiar geographic areas, attesting to a non-IE origin. That makes a suggested “PIE proto-word” *ghend- an unsubstantiated fiction. The “PG proto-word” *getan is a childish calque of the attested Gmc. forms, and by extension a calque of the attested Türkic forms (-an is a typical Gmc. verbal suffix), it is shared exclusively with Türkic languages. A phonetic aspect is stated by Clauson 1972 on the -d-/-t- to -y- transition within certain group of Türkic languages, with -y- presumably a later form. That observation is based on the review of the forms, mainly derivatives like kadir- and semantics that survived into the pre-13th c. That process pertain to the Gmc. languages, it demonstrates overlapping zonal furcation in distribution into gei and get- areas, the more western get- and more eastern gei. See God, give.

English gird (girt) (v.) “put a waistband on or around”, girdle (n.) “sash, waistcloth” ~ Türkic qur- (v.) “arrange, build, line up, gather, stretch”, qur (n.) “sash, belt”. The stem forms 16+ semantic clusters extending from girdle to garden and far beyond, it includes celebrities Kurgan, Kur/Gur/Chur (royal title), and a long trail of mundane vocabulary. As a “girdle” and “garden” with derivatives, it shows up in Rus./Sl., Pers., Caucasian, Mongolian, and across Europe. A complimentary qür-/gür- extends vocabulary and emphasizes some words. Cognates: A.-Sax. (OE) gyrdan “put a waistband, belt on or around; encircle, surround”, geard “hedge, enclosure”; ONorse gyrða, OSax. gurdian, OFris. gerda, Du. gorden, OHG gurtan, Goth. gards, garths, OIcl. gerði, Gmn. gürten; Welsh gwregysa; all Gmc. and Sl. cognates of garden, court, yard, and curtain; Sl. grad (ãðàä), gorod (ãîðîä), gorodit (ãîðîäèòü) (v.) “enclose, surround, enclosed, surrounded”; Chuv. karta “fence”; Alb. garth; Balt. (Lith.), OSax. “enclosure”; Gk. korthílai (κορθίλαι); cognates in OHindi grhas “house”, ghira “encircle”, Av. gǝrǝδo “cave”, Arsi oasis (aka Tokhar B) kerciye “palace”, Phryg. gord, all related to “enclosure”; Türk. derivatives and allophones yarïndaq, qur, qursaɣ “sash, belt”, kurgan “built tumulus”, qurla- (Cf. garland), qursa- “to girdle”, qursaɣïl “be surrounded, enclosed”, etc. The A.-Sax. begyrdan “to gird, clothe” is lit. a compound of 3 Türkic lexemes: be (bol) + gird (qur) + dan (don) ~ “to don a gird”. The widely shared and still active agglutinated Türkic suffix -t/-d forms abstract nouns. The distribution of the allophones and derivatives points to an origin of the verb qur- to at least 5th mill BC (Celtic in Europe 2800 BC, Celtic departure from N.Pontic at or before 5th mill BC, migration to Indian Peninsula 1500 BC, Phrygian migration 1200 BC), and accordingly dates the Türkic abstract noun suffix -t to at least 5th mill BC, i.e. to the early stages of the nomadic expansions to Europe. The gird and belt are parallel constructions from their respective stems qur-/gur and bel “waist” respectively. Together, they present a paradigm transfer case attesting to the inheritance from the Türkic phylum. See belt, court, curtain, garden, garland, guard, yard.

English give (v.) “gift, grant, bestow, confer, lend, impart, present, hand over, pass” (Sw N/A, F145, 0.16%) ~ Türkic qïv-, qïw-, kïv- “give, bestow”, synonymous with bağıš- (/baish/ with silent -ğ-) “bestow, confer, present” and ber- “give, hand over, grant”. The notion “give (blessing)” reflected in the Lat. calque annuit “nod assent, favor, bless, anoint” is imprinted on the USA currency. The literal notion “give” had to appear before the sacral notion “give blessing”, which can be timed to the first signs of Tengriism attested by the first cairns, about 6th mill. BC. Some Türkic Sprachbunds sacramentalized the word qïv- “give” and replaced it with lay metaphoric meanings of other words (Cf. ber- “bear”) Other Sprachbunds (Cf, Gmc.) continued its use in its lit. sense. The Türkic trifecta of terms constitutes a hierarchy, qïv- “bless, confer”, bağıš- “bestow, grant”, ber- “give, bring, bear”, with qïv- most dignified and ber- most mundane. The qïv-, kïv- is used in religious contexts as “bestow a fortune”, a semantic relationship akin the English usage of sg. you vs. thou. English has preserved allophones of all three words, give, bestow, and bear, with some change in semantic accents (Cf. dignified bestow and mundane give, the bear is more “carry” and less “give”). For the etymology of kiv-, G. Clauson waded through a series of cognates and analogues, in both sacral and secular fields, only incidentally referring to the embarrassing word qïv- “give”. The OTD translates qïv as noun  “fortune, fate”, but immediately cites expression qïvčaq (v.) qovï (n.), lit. “given (bad) fate”. The form qïvčaq positively attests that qïv- is a verb. Qïvčaq is a grammatical verb with exclusively deverbal adjectival suffix -čaq. Qovï is a grammatical noun with possessive 3rd pers. sing. and pl. suffix : “given, bestowed (deverbal adj.) (bad) fated” (denoun adj.). The expression lit. means “unfortunate” ( adj.), i.e. “(his, their) bestowed (bad) fate”. The same is attested by the expression qutadmaq qïvadmaq “be, become fortunate”, where qutadmaq (n.) is “fortune” and qïvadmaq (v.) is “given”. Or expression qutluɣ qïvlïɣ “lucky, fortunate” where adjectival suffixes -luɣ, -lïɣ indicate possession of property: “lucky” + “given”. The word ïv “give” probably existed long before the blurry primitive etiological figments crystallized into a harmonious system of Tengriism, which gave it a sacral connotation of “bless, send fate, give fortune”. That premise correlates with the first Tengrian cairns and kurgans of the Middle Volga, Samara, and Khvalynsk archaeological cultures dated by 8-6 thousand years ago. An A.-Sax. deverbal noun giefu, giefe (n.) “gift”, giefe, giefes (v.) “grace, favor” carried a sacral underpinning of the verb qïv-, kïv- “give, bestow (grace, blessing, supreme favor)” documented in the sources (formulaic kïv kut “give grace, fortune”, see God). That is a case of paradigmatic transfer (See gift). In the eastern languages, the verb kiv- historically had a notion of expressly gracious form “bestow” vs. the mundane “give”. It can be asserted that the initial semantics of the qïv-, kïv- was a mundane “give” that in the milieu of the eastern languages shifted to sacred connotations, similarly to a use of archaic oration in other religions, starting with Sumerian. Traces of the gracious notion are clearly seen in the archaic Gmc.: “give, bestow; allot, grant; commit, devote, entrust”. In the Gmc. languages “give” did not carry over a sacral connotation, attesting to its origin in the western Türkic sphere, outside of the eastern Türkic languages. Cognates: A.-Sax. geaf, giefan “give, bestow”, WSax. gief, OE giefan, OFris. jeva, ONorse gefa, ODan givæ, OHG geban, Goth. giban, MDu. gheven, ME yiven, Du. geven, Gmn. geben. Besides Türkic languages, of all the IE languages, cognates are cited only in the Gmc. group. That is profoundly misleading (otherwise called falsification or machination), Cf. expression: Eng. give me, Gmn. gib mir, Tk. kiver min, Tur. kiv bana (m/b alternation), Az. mənə kivmək, vs. “bestow, confer or present” of the Tr. bağıšlamaq, bağıšlamak. Suffixes -an/-en, documented in Gmc. forms, are reflexes of the Türkic verbal suffix -an, -än (-n, -ïn, -in; -un, -ün) marking a verbal voice, in these examples an active voice. A mechanically derived from the Gmc. forms fictitious PIE surrogate *ghabh- is brazenly expanded to the whole IE family contrary to all contraindications. Etymologically that is unsustainable. In the past, the origin of the word give remained conjectural at best, defying all attempts to find it. The word give is a member of paradigmatic transfer of a large group of Türkic sacral terminology that attests to its common origin from the Türkic linguistic community. A presence of these words in English is a clear attestation of a genetic inheritance, a significant evidence on the linguistic scales. See bear, bestow, gift, God.

English go (v.) “move” (Sw N/A, F57, 1.08%) ~ Türkic git-/kit-/ket- (ked, keδ-, kej-) (v.) “go”, with extensions “leave, depart, pass, disappear, die, refrain”, and auxiliary verb of action. It is nearly synonymous with it-/yit- “go, go in front, lead”, see itinerary. Both words were appropriated into “IE” arsenal. In modern English, past tenses of be (< Türkic var- (v.) “to be”) and go (< Türkic maŋ-, maŋla-/baŋ-, baŋla- (v.) “stepp(ing), stride”, m/b alternation) come from entirely different verbs. Cognates: A.-Sax. gan “go, advance, depart, happen, conquer”, witan “go” (> went), OE gaed (past tense), OSw., OFris. gan, MDu. gaen, Du. gaan, OHG gan, Gmn. gehen; Fin. kiita “hurry”; Mong (Kalm.) ketrxä “go by”. The synonymous it-/yit- has produced synonymous OIr. eth-, ethaim “go, I go”, Ir. (bo)(i)thar “(cow) road”, Gaul. eimu “we go”; Goth. iddja; Balt. (Lith., Latv.)  eiti/eimi/iet/eimu/iemu/eit/eisei; Sl. idya/ida/iti/idu/isi/issti; Lat. iter “way”, Skt., OHindi eti “goes”, Skt. jihite “leave”, Av. eiti; OPers. aitiy, Gk. adeia (αδεια). Distribution covers the width of the Eurasia, crossing linguistic barriers to where IE foot did not trod. Positively from non-IE origin. Goth. forms are eidetic with Sl. and Balt. forms. The Türkic prosthetic anlaut consonant g-/k- points to Ogur form it > git, from which developed Gmn., Skt. and Chinese 去 (shi) forms. The Türkic auslaut verbal suffix -t is is agglutinative marker found in most of the dialectal forms, related to grammatical person and tense in verbs and forming deverbal nouns. The Goth., Skt., OIndian, Av., OPers., and Gk. forms point to an alternation i-/e-. The form baŋ- is idiosyncratic because its many CT homophones are confusing and unsustainable in a long run; that makes it a potential diagnostic trait. Both Eng. and Tr. have preserved a semantics of intention: “I am going to do something”. The PIE faux *ghe- “to release, let go” belongs to a hypothetical ether theory, peculiarly mimicking the Türkic attested forms. The uniformity of forms across families (Celtic, Türkic, Skt., Gk., Fin., Mong. forms) points to pre-2000 BC N.Pontic origin. The Celtic agglutinated forms follow the Türkic grammar and attest to the existence of the word prior to the Celtic circum-Mediterranean anabasis of the 5th-4th mill. BC. The Chinese word is likely a reflex of the Scythian Zhou component in the modern Chinese language. The peculiar uniformity across Eurasia rules out the origin from any of the PIE hypothetical kernel locations, leaving the Türkic nomads as agents of dissemination. Paradigmatic transfer of the synonymic words for go and be across languages vividly attests to a common source from the Türkic linguistic milieu. See itinerary, was.

English gore (v.) “wound by piercing” ~ Türkic göreš- (v.) “penetrate, fight, kill”, güreš- (v.) “kill” (Robbeets M., 2004, #62). Ultimately fr. a prehistoric adj. kür/gür “mature, superior, eminent, brave” widely distributed from primeval times across numerous languages. The overlapping göreš-/güreš- and küräš- “encounter, fight, wrestle, war” are connected by invisible ties (Cf. A.-Sax. garraes “battle”), and with kür/gür “courageous, and with title “Kur/Gur/Chur”, which probably all come from the same ultimate origin, settled in different languages, and relay sibling notions in various phonetic alterations. Cognates: A.-Sax., A.-Sax. gar “spear, dart”, garcwealm “death by spear”, Du. geronnen, Norse gørr, Gmn. Gehren, Scottish gorren “pierce, stab”, Welsh gornio; Lat. cruore, Sp., Fr., It., encornar “be gored”; Gr. gkor (γκορ); Arm. aryun (արյուն), Guj. gora (ગોર); Balto-Sl. (Lith.) kraujas, (Ru, Ukr.) kol-; Finn. hurme, Hu. har-; Georg. gori (გორი); Heb. ngoah (לִנְגוֹחַ); Azeri koy; Ch. geer (戈爾); Jap. goa (ゴア) “gored”, korosu “kill, murder”; Kor. go-eo (고어). Etymology is “origin unknown”, positively no IE etymology. This most ancient word is one of the early infiltrators into European languages. The amazing Eurasian distribution covers numerous linguistic families indicating widespread borrowing, and a true mobile source not replicated till the Information Age. Cognates are found in nearly every language in the Eurasia from Pacific to Atlantic, save for relatively few and remote from the Eurasian steppe belt. The phonetic homophony is striking, and semantic match is perfect. Nearly every language has developed a nest of derivatives connected with wound, blood, piercing, kill, and many more. The Türkic languages developed numerous allophones: göres-, güläs-, güres-, keris-, küres-, etc. The göres- pierce must be one of the most ancient Türkic lexemes introduced into European languages ca 4800 BC via circum-Mediterranean Kurgan route of the people marked by R1b Y-DNA marker, and ca 2,500-3,000 BC Turkification of Europe and South-Eastern Europe by the same Kurgan R1b people via the overland routes. Its presence in the Celtic group attests to its presence in the Eastern Europe at 6th-5th mill. BC, way before 2-3 mill. later the first Kurgan waves ventured to the Central Europe. The distribution traces the spread of the Celtic people from Iberia, their incursion to the Apennines, and an independent overland path from the steppe belt to the Baltic and on toward Albion, a distinct path of the Kurgan people. The Türkic lexeme could have been introduced to the Far East by the Xia nomads reported in the Chinese annals ca 2300, and by the Zhou “Scythians” Kurgans migration of 20th – 16th cc. BC. That attest to the Türkic cultural influence in forming the Chinese nation and language, and to blending with people that later came to be Koreans and Japanese. Its appearance in the Heb. apparently belongs to the time when in the 7th c. BC, Scythians dominated Palestine. It is one of the few Türkic non-religious terms found in Hebrew. Its appearance in the Gujarati corroborates Gujarati claim to the origin from the Ephthalite Huns. The singular -r/-l alternation of liquids, characteristic to Chuv. but not to the Sl. group, and absent in the Lith. form, points to a separate Sarmatian path datable to the 2nd c. BC or to the Western Hunnic time. The distribution of the word, from Pacific to Atlantic, makes the etymological association with the Kurgan people and Scytho-Sarmatians unavoidable. Distribution of cognates repeats the pattern of other words with etymologies “of uncertain” and “of unknown origin”. Obviously, all these forms were inherited via numerous independent paths from diverse Türkic vernaculars, creating a spectrum of allophones. The spatial distribution of the allophones and derivatives points to an origin of the verb göres- to at least 5th mill BC (Celtic in Europe 2800 BC, departure from N.Pontic before 4th mill BC). Speculation on immediate connection of gore- and quarrel is not warranted, since quarrel is an allophone of a different Türkic word (qaršï) from a separate Türkic root. See courage, quarrel.

English grind (v.) “fragment, abrade, scour, scrape” ~ Türkic qïr- (v.) “scratch, scrape, scrub”, reflexive form qïrïn- (v.), passive deverbal noun (Turkm.) ğırındı, Khak. kırındı: “crumbs, small fragments”. Elision of two vowels shows that the word was internalized via a furnace of heavily consonantal language with aversion to “extra” vowels and without appreciation that -ındı were specific morphological modifiers. Cognates: A.-Sax. grindan “grate, scrape”, cearclan (with k-) “gnash”, cearcetung (with k-) “gnashing, grinding”, forgrindan “grind down”, etc., Du. grenden “grate, scrape”; Lat. frendere “gnash ”; Balto-Sl. (Lith.) grendu “scrape, scratch”. The Lat. version is so distinct from the Gmc., Balto-Sl., and modern Tr. forms that apparently Lat. received its word via a different route. The word is an oddball within the IE family, an overt loanword. Attempts to invent some common PIE “proto-word” are misleading and misguided. The phonetic and semantic continuity between the Türkic and European allophones stands beyond any doubts. A paradigmatic transfer of a related cluster grind, curt, short attests to a common genetic origin from the Türkic milieu. See curt, short.

English guess (v., n.) “suppose” (Sw N/A, F214, 0.07%) ~ Türkic us- (v., n.) “think, suppose”. The form us is one of the series us/hus/hos/is/üs/ug/uq/ög/ök with related and at times overlapping semantics “wit, mind, smart, percept, think, suppose” etc. It raises a consideration that once it was a single-vowel word u/ö. It developed in closely related vernaculars, attaining synonymous modifiers -s/-g in allophonic forms. In Sprachbund environments each form blossomed into clusters of derivatives that retained initial root form and developed into a bouquet of fraternal genetic lines. Where they underwent a sequence of Sprachbunds' amalgamations, some descendent Sprachbunds retained bouquets of articulations and semantics. To a certain degree English is a linguistic island preserve filled with joyful zombies. The word us-/guess is a member of the intellectual complex transferred to English as a paradigm: ö-, san-, saq-, and us- which made the Eng. awe, sanity, sagacity, and guess respectively, see awe, sagacity, sanity. Guess is an allophone of us- “think, suppose” in an Ogur form with a prosthetic anlaut g-/ɣ-, attested by Romanized pronunciation hus-/hos- and spelled e.g. gus- and hus-. Notably, the Ogur prosthetic anlaut consonant has not always survived, it was supplanted by the Oguz non-prosthetized form beginning with vowel (Cf. A.-Sax. geong “young” vs. Eng. young and Tr. yangi:, see young). Cognates: A.-Sax. not recorded, ONorse geta, MDu. gessen, MDan. gitse, getze, MLG gissen, all “guess”; Welsh gesio. The Arab. form ğuš, ğušiya “not to understand, to be at a loss” closely parallels the Tr. compound idiom (g)usi az- “guess”, in both cases lit. “understand little”. Distribution is peculiar, across the width of Eurasia with confined outlier islets. The guess obviously originated in non-IE languages. It is an oddity among the mass of the IE languages, and is peculiar only to a handful of specific languages. That positively rules out any “common PIE” invention. The IE etymological attempt to equate “guess” and “get”, and then use “get” to invent etymology for “guess” is ludicrous. If the MDan. inlaut -ts-/-tz- really spells out “guess”, these ligatures attempt to relay phonetics unrelated to the notion of “get”. The Eng. ligature -ue- apparently attempted to transmit a particular -u- sound otherwise conventionally spelled in English with -oo- (Cf. cook). The modern phonetics /ges/ is reconciliation between the customary pronunciation and the written form. The Arab. form with the prosthetic anlaut g- attest to the Oguric source for both languages. It is also a screaming testimony against the “common PIE” origin. Assertion that the word us is peculiar to the Türkic “purely Western word” (Clauson, p. 240), with a related form u:z “skilled, knowing” with a different quality of the vowel u- must be taken with a grain of salt. If the Huns could collect tribute from the Far Eastern China in the 2nd c. BC and then from the Mediterranean Rome in the 5th c. AD, they profoundly defy the Eastern/Western presumption. Allophone distribution rather resembles a pie chart, with sectors shaded by geography and colored by modern ethnic assemblies. The presence of the fossil R1a Y-DNA marker both in the Western Europe and in the Far East tells that the Hunnic western incursion of the 4th c. AD probably was neither a first nor the only one. The transfer of the entire intellectual paradigm, q.v, the phonetic and semantic match indelibly attest to the genetic connection with Türkic linguistic field. See awe, cook, sagacity, sanity, young.

English hack (v.) “chop, cut away”, hatchet “ax” ~ Türkic kes- (v.) “cut, divide”, kes (n.) “part, piece”. The indiscriminate noun/verb root attests to its antiquity, an yearly stage of the language. Türkic has numerous allophones and derivatives, and wide semantic range, attesting to deep antiquity and geographic spread: kes, ges, xas, kys, kesüü, kesu, kisü, kestir, keserge, kizerge, and more. The root kes has a sibling ker-, i.e. kert (v.) “chop, chop off”, kerki (n.) “axe, hatchet, hoe, dagger” with r/s alternation and its own trail of native forms and loans to Mong. (kerchi, hərchi “cut, chop, chop off”, hərchim “part, piece”, Kalm. kerchm “section, piece”), Tungus, Manchu, “Indo-German” (nowadays IE), Bulg., Alb., Pers. (boridan “cut”) and Kuchean parallels. The systemic r/s bifurcation of the root and derivatives, and complexed paradigmatic transfer to various host languages indelibly attest to a common genesis from the Türkic linguistic world, defying any claims to an IE origin and challenging the faux “IE reconstructions”. The differing morphologies of the siblings indicates that the branches did not mingle too much. Cognates: A.-Sax. haccian, OFris. hackia, ONorse höggva, OHG hacchon, Du. hakken, Gmn. hacken; OFr. hache “axe, pickaxe”, hachete “hatchet”; Ugric hache, Hu. hasit; Kuchean has- “cut”; Akkadian kes-, kas- “cut, chop, shorten”, käs- “chop”, the earliest record of 28-24 cc. BC; Akkadian in Arabic kasap, kısım, kasım, kısmet, etc. “cut”; Chuv. has (xas), Bashkir kicheü. The areal distribution of the European forms is confined to the central and northern belt of Europe, unrelated to Romance and Indo-Iranian branches with their own native terminology. The conundrum is clear, either Northern European, or Southern European (Lat. trucidabunt), or Indo-Iranian version (Pers. boridan, suspiciously Türkic-looking) can be held as IE, but not all three. The OFr. form with -ch vs. -s/-sh and vs. -k forms probably reflects a Middle Age Alan or Burgund source. The western Chuvash and Bashkir forms clearly are allophones of the kes with k/h/x and s/sh/ch transitions. These were the western Türkic branches, collectively called Sarmats, they carried the form hach/hack to the Northern Europe and to the Ugrian languages. The transition k/h is regular between the OT (Oguz) forms and Gmc./Eng. forms, e.g. gird ~ hilt. A Türkic origin is impeccable. See curt, cut, short.

English harm (v., n.) “damage, injury” ~ Türkic ur- (v.) “beat, strike”. The verb/noun root ur/wur relays 11 semantic clusters plus paired idioms. Its penetration into eastern languages has long been established. The Eng. and Gmc. allophones of ur- have acquired a prosthetic anlaut h- typical for the Gmc. and Sl. languages, A.-Sax., and Cockney, attesting to a probable path distinct from that for the word wound (< yon, prosthetic v-/w-), see wound. That hypothesis is supported by the presence of the ONFr. hurter “strike, collide”, in contrast to the concentrated distribution of the cognates for the harm and wound. The synonymy and consonance suggest that harm is an allophone of hurt that reached A.-Sax. together with wound but separately from hurt, see hurt, wound. The A.-Sax. hearm ends with with the auslaut -m, a relict of the Türkic/Celtic -im “me”, Cf. ursïqïm “(I am) beaten, stricken”. The same process with a 3rd pers. suffix -ti:/-di: would produce the form hurt (v.), and hurt (n.) with with a deverbal noun suffix -t, see port. Cognates: A.-Sax. hearm “harm, injury”, OE hurt (1200) “injure, wound”, OSax. harm, OFris. herm “insult; pain”, ONorse harmr “grief, sorrow”, OHG, Gmn. harm “harm, grief, sorrow”. No IE cognates. In support of the PIE dogma, the IE etymology had invented some ethereal PG and PIE “proto-words” for the harm, and separate PG and PIE “proto-words” for the wound, while allowing hurt to come from an “uncertain origin” which stands for “we don't have a clue”. A complete absence of the cognates in the Romance and IE Asian groups, an oddball loanword in ONFr., probably from an Alanic source. Notably, while Goth. does not appear to have a cognate for harm “damage, injury”, it does have numerous synonyms shared with A.-Sax., pointing to an older common Goth. - A.-Sax. linguistic base. Following discrete and unique paths for each member of the linguistic community, they amalgamated with Türkic languages. The ubiquity in the Gmc. group attests to, first, a distinct areal northern and southern Sprachbunds, and second, to the later internalization by the northern Gmc. group, after the Romance farmers bounced back to the south-central Europe ca. 1500 BC, and the Indo-Aryan farmers departed on their South Asian trek ca. 2000 BC. The possible time for the internalization appears to be the long period from the Corded Ware bloom to the Sarmatization of the North-Central Europe. The Türkic idiomatic compound ur- yon- combines two nearly synonymous words into an emphatic pair, of ur- “beat, strike” and yon- “wound, hew, cut”, translated as “beat, torment”, see allophonic forms of ur- hurt, hurl. A paradigmatic transfer of the cluster harm “damage, injury” and hurt, hurl ascending to a common root ur-, and the ur- “injure” and yon- “wound” separately, and ur- yon- paired in attested Türkic emphatic idiom, provide irrefutable evidence of the Türkic origin for these words. See hurt, hurl, wound.

English have (v.) “possess”, (n.) “possessor” (Sw N/A, F19, Σ1.25%) ~ Türkic kap- (qab-/qap-) (v.) “keep, seize, grab, hold”, ultimately from from kap (qab/qap) (n.) “container, vessel, box” (lit. interpretation) or a shape of the container contents taking a container shape (metaph. interpretation), like frozen water in a container, see cup. The verb verb has 7+ semantic clusters containing at least 30+ meanings, a prime meaning “grab”. That points to an origin from trapping hunt. The have is an allophone of keep, and is partially interchangeable with it, see keep. It is also a cognate of It. capisci, Sp. capiche “seize (an idea, a thought)”, Lat. capere “grasp, seize” of the Romance group. Those come from the semantic cluster “perceive, seize” of the same verbal root. Cognates: A.-Sax. habban, hafian “keep, have, guard” (also with prefix at-/aeth-, be-, n-, etc.), aethabban “retain”, nabban “not have”, ofhabban “held back”, OSax. hebbjan, OFris. habba, WFris. hawwe, Saterland Fris. hääbe, Du. hebben, Dan. have, ONorse hafa, Norw. Nynorsk ha, Sw. hava, Icl. hafa, LGmn. hebben, hewwen, Goth. haban “keep, have, possess, hold”, Gmn. haben; Lat. capio “take”; Rus. xapat (õàïàòü) “seize”; Alb kap (“grab, catch”; Fin. kaapata “capture”, Hu. kap “receive, get”; Mong. havshix “bite (fish)”, Kalm. xawl- “catch (ball), bite (fish)”; all “have” except as noted. The cognate line can keep going for a good while more, just the English only has 11 formalized forms for “have”. The word has been massaged by generations of Turkologists for more than a century, and there are still more to capture. Distribution is typical for Türkic loanwords: Eurasian steppe width with some far western and far eastern appendages, crossing linguistic barriers, with a caveat. In Türkic languages possession is formed by a choice of suffixes, most frequently -lig/-lïɣ, so a need for a separate verb “have” is minimal and it is usually formed by a denoun verbal expression “be” or “be owner”. A semantic re-purposing of the notion “hold” to “have”, and its use as an auxiliary verb with heavy syntactical load are uniquely European. It is limited to particular European areas and to the European vernaculars. Although different groups of vernaculars adopted different Türkic verbs, they are calques of the Gmc. “have”: Türkic taŋ /tang/ “tie” > Romance tengo, tener “have”, Türkic menin “mine, my” > Sl. moi, imet, Fin. minun. The internalized phonetic and semantic match was soundly established within the IE-centered etymology. It dead-ended at the neologistic “PIE reconstructions” with faux “PG” *habejanan and “PIE” *kh2pyeti, *keh2p-, endowed with Türkic abstract suffix -an and verbal suffix -ti. On top of that, the cognate keep was assigned another faux “PIE proto-word” *gab- “to look after”, and the Romance cognate capisci, capiche, capere was assigned a faux Romance “proto-word” *kapio. They all reek of myopia and quasi-scientific charlatanry. Thus no sane IE or any other etymology whatsoever. The omnipresent importance of the “have” in Gmc. languages is supreme, it fills 1.25% of the English daily lexicon, or every 80s word in a random English text. Too hard to bungle. A listing of paradigmatically transferred  kap-/qab-/qap- based words in English would number in dozens, offering inextricable evidence of the common genetic origin from the Türkic phylum. See cap, capture, cup, keep.

English haze (v.), hazing (v., n.) “subject to cruel horseplay” ~ Türkic häzl (n.) “joke”. Semantic indicates a trade word, tending to cloistered use like of military or sailors. Tradition of hazing must be older than a cavalry horse riding, the word may be as old as organized armies, but an absence of its cognate in Celtic languages questions its mounted coming to England in the 3rd mill. BC via circum-Mediterranean route. Absence of its cognate in Romance languages questions its coming to England via Scythian or Etruscan route. That leaves Cimmerians, Sarmats/Alans/Ases, Burgunds and Huns as possible source. The near perfect phonetic and perfect semantic concordance does not leave any room for doubts of its Türkic origin. The Old Türkic Dictionary (1969) gives only recorded form for the noun, but Türkic morphology allows the word to be a verb. Cognates: A.-Sax. heascan (heashan), hysan, hyscan (hyshan), hyxan “jeer, deride, taunt”, OE hawze “terrify, frighten, confound”; MFr. haser “irritate, annoy”; Ar. häzl “joke”. A.-Sax. numerous allophones indicate either unsure articulation or diverse sources for an alien word. Unlike A.-Sax. synonymous gliw “glee, joy” with its numerous derivatives, heascan has no derivatives, also indicating a concrete loanword. Distribution is peculiar: across Eurasia with spots along migration routs. Ar. as a source can be positively excluded for absence of prehistoric contact with Eng., it attests to a spread to the Near East and to crossing linguistic borders. The Fr. source may point specifically to the nomadic Alans or Burgunds. No IE etymology. Supposedly, English haze “horseplay” is “of unknown origin”, which is a euphemism for giving up on a habitual IE origin. That is contrary to the presence of the A.-Sax. evidence. The semantically distinct homophonic English haze “moisture, dust, or smoke” in OT is ïs “haze, fog, murk” with perfect semantics, which tentatively could allow etymological linkage with prosthetic h- and a shift of laryngeal vowel ï, but unless it can be demonstrated that some of 42+ Türkic languages have a form closer to haze, the phonetic link is too shaky. On the other hand the “unknown origin” of the English haze “fog” and consonance with a prosthetic h- leaves room to also suspect a Türkic origin. The stipulation that “The English differentiation of haze, mist, fog (and other dialectal words) is unmatched in other tongues, where the same word generally covers all three and often “cloud” as well” is brazenly false. Türkic has a matching allophonic stem muz for “mist” (see mist) and a matching allophonic stem bug for “fog” (see fog). Rather, the parallel presence of the stems ïs (hïs), muz, and bug attests to a case of paradigmatic transfer evincing a demographic convergence. It could be that some forms of Türkic häzl and ïs conflated either in English or on the road to English. See fog, mist.

English hide (v.) “conceal” (Sw N/A, F1128 Σ0.01%) ~ Türkic qod/qud (v.) “put aside, deposit, lodge”, (n.) “inside, bosom, hollow”; “hide” (v.) and “hide” (n.) are concrete semantic extensions. Ultimately a derivative fr. a root qo “drop off, complete, completeness, done, leave behind”, e.g. something finished and over. The Türkic term means “placed inside, into the bosom”, and with the noun-producing suffix -qa/-ɣa/-qa/-gä/-kä forms quyaɣ “armor, cuirass, shell” (lit. “inside the bosom”) and qudqa (n.) “hide, pelt, skin”. Acoustical modifications are at a high end of Türkic variability. The transitions g/k/q to h, and of semi-consonant y to th/t/d are consistent with similar observed linguistic modifications, they may be a result of assimilation into an alien linguistic system, or a local development within the Turkic family under an influence of an alien linguistic system. The primeval second consonant wobbles between -d/-δ/-n/-y/+. Cognates: A.-Sax. hydan, hedan, MDu, MLG huden “to hide”; Ir. cheilt/chur/chun “to hide”, Welsh chuddio/guddio/kuddio; Mong. qodi- “late”. Distribution is typical for Türkic lexicon, ubiquitous across the length of Eurasia, and spotty along the fringes. The IE etymology is feeble, it needs to be retracted. It addresses only selected samples aimed at predestination, and loosely resorts to random, unsuitable, and far-fetched cognates, like “eyebrows” and “clouds”, “hut” and “sky”, and “guts”, the later unwittingly echoing the Türkic for “inside, bosom”. The Celtic cognates carry a mark of a local Sprachbund, the Ir. chun (hun) reflects a form cited by G. Clauson (EDT p. 631). Celtic forms suggest two separate paths, one overland, and the other Celtic circum-Mediterranean. Observed alternations suggest an initial form kon, kod, and hint on dialectal split, bifurcated into western and eastern forms for Celtic (-l/-n/-r forms) and Gmc. (-d form). Mong. semantics ascends to the base semantics of qo-. See gut, hide (n).

English hit (v., n.) “afflict suddenly” (Sw N/A, F479, 0.02%) ~ Türkic at-, ït-, it-, jit- (yit-) (v.) “push, thrust, shoot”. The word has at least 10 semantic clusters. A prime notion of the verb is “throw”, it is cited in all semantic listings of the verb, along with secondary meanings like “hit”. The voiceless affricate jit- has voiced semantic and phonetic siblings chit- (Cf. cheat) and shit- (Cf. allophones shot, shoot). A blank for the A.-Sax. form shot likely was a more common in circulation form at-. Internalization of shot and shoot visibly came separately and by a different rout from the form hit. Cognates: A.-Sax. sceotan (with sh-) “hit, hurl, strike”, OE hyttan, hittan “hit”, ONorse, Sw. hitta, Norse, Dan. hit, hitte, Gmn. hauen; Icl. högg; Fr. heurter; Balt. (Lith.) atsitrenkti; Gk. epitychia/ktypi̱ma/chtypo̱/ktypo (επιτυχία/κτύπημα/χτυπω/κτυπω); Alb. qit “hit”; Kor. ath .“throw away”. The word has no IE etymology other than faux “reconstruction” hit = it “this one” that unwittingly hits on the prosthetic initialh-/k- stipulated above. Otherwise, in spite of the extensive Gk. forms, not even “of unknown origin” for the hit. The prosthetic h-/k-/ch- allows to suggest an Oguric origin, like Sarmat, Hunnic, and Bulgar for the northwestern European zone and Gk. forms, and Oguz for the Lith. form. A prosthetic consonant in the northwestern European cognates of the Türkic lexicon is consistent with numerous other cognate forms. The perfect semantic and phonetic match indeed makes a random phonetic coincidence between the numerous European languages and even more numerous Türkic languages impossible.

English howl (v.) (wail, yawl, yell, yowl) “shout, loud cry” ~ Türkic yel (n.) “wind, howl (demonic)”. For a raster of verbal notions, cognates, and development of the notion “howl/wail/yell” as a verbal derivative of the yel “wind”, see yell. A complimentary semantically related yaŋ (n.), yaŋra- (v.) “echo, loud reverberation” suggests that the initial root was ye-/ya- “loud sound, boom, roar, thunder”, and l- and ŋ- once were morphological discriminators. The Türkic verb ulï- “wail, moan, bellow” is also an allophonic derivative cognate of the noun and verb yel. Phonetic range is wider than usual, dictionary lists yel, yǝl, yil, yi:l, jiil; yïl, jel, djel, žel, žil, zel, del, chil, šel, sil, kül(ək), plus Pers., Mong., Tungus, Manchu versions. Those distinct forms afford valuable diagnostic opportunities. For the noun “wind”, A.-Sax. had adopted a peculiar IE form, an allophone ve- with A.-Sax. hallmark prosthetic anlaut v-/-w and -l/n- variation, consistent with numerous IE allophones, Cf. Lat. ventus, ululo “wind” “howl” vs. Gk. anemos, ourliazo. The European forms including English are transparent allophones of the Türkic prototypes yel and yaŋ. An oddball word turned to be a nice loupe into the IE origins. Cognates: A.-Sax. gyllan, OE giellan, Mercian gellan “yell”, allophonic cognates “gale”, “nightingale” from gaile “wind, song” fr. howl, claimed as “origin uncertain”; ONorse gjalla “reverberate”, gol “breeze”, galinn “bewitched”, OHG gellan, MDu. ghellen, Du. gillen, Gmn. gellen “yell”; Welsh weiddi, gweiddi “yell”; Balto-Sl. (Latv.) kliegt, (Lith.) kliegti “yell”; possibly Gk. kikhle (κίχλη) “songbird”; Hu. szel (sel) “wind”; Mong. salki (n) “wind”; Even. nələ “wind”. Distribution is typical for Eurasian Turkisms, from the Eurasian Steppe belt to the Eurasian fringes, across linguistic barriers. A claimed association with the Sl. golos (ãîëîñ) “voice”, a cognate of Eng. call and Türkic qol, is erroneous. The IE etymology did not recognize that “ululate, lull” and “howl/wail/yell/yawl/yowl” are allophones, and on purely phonetic grounds tried to seek different individual PIE precursors for this series with close phonetics and semantics. The paradigmatic transfer of both meanings “wind” and “howl” from the Türkic milieu is incontestable. See yell, ululate, lull.

English hurl (v., n.) “throw” ~ Türkic ur- (v.) “beat, strike, throw”. Much like the Türkic/English universal verb tu-/do- “do, act”, the verb ur- is universal and polysemantic, it carries a notion of sharp or short movement (i.e. throw an arrow, throw a glance, step on) or a directional action (i.e. extend a hand, give/bestow name, title, law), to name just two of many uses. For English hurl, the best example is a fully conjugational paired expression oq ur “to shoot an arrow”, where oq is “arrow”, and ur denotes the action of “throw, shoot”. The notion is an one-time event, like a strike, or it could be a series of one-time events. The affinity of the hurl and ur- is directly attested by the perfect semantic and close phonetic similarity, by systematic prosthetic anlaut h-, and by the attested Türkic idioms containing the verb ur- with the concrete meanings inherited by the A.-Sax. and OE (harm and hurt) and a complementary A.-Sax. wound. These are the oq ur q.v. and an emphatic pair ur- yon- “beat, torment”, see hurt, wound. Morphologically, hurl is formed with a poly-functional Türkic/English suffix -l, here a verbal suffix of passive voice and adjectival/adverbial suffix -l, Cf. “usual”. The cluster harm, hurl, and hurt is formed by internalizing into the roots the Türkic suffixes -m, -l, and -t respectively. The paradigmatic transfer of two roots, ur- and yon- separately, their attested compound in the Türkic emphatic idiom ur- yon-, and the line-up of the Türkic suffixes provide irrefutable evidence of the Türkic origin for the entire cluster inclusive of wound. See harm, hernia, hurt, ore, usual, wound.

English hurt (v., n.) “ache, harm” (Sw N/A, F310, 0.05%) ~ Türkic ur- (v.) “beat, strike”. The Eng. and Gmc. allophones of ur- have acquired a prosthetic anlaut h- typical for the Gmc. languages, A.-Sax., and Cockney, attesting to a probable distinct path from that for the wound, see wound. The final -t is probably a relict of the 3rd pers. verbal suffix di:, or the pl. suffix -ter/-der contracted to -d/-t, Cf. ONFr. form hurter “strike, collide”, or a deverbal noun suffix -t, see port. It also parallels the bifurcated articulation of the synonymous Türkic dialectal (Ottoman) yon-/yont- for an allophone wound, which comes etymologically as non-explainable bservation. The semantic raster of the Türkic cognates is far wider then that of the Gmc. cognates, attesting to a much wider dissemination of the word among the Türkic ethnic communities, and accordingly far deeper roots. Cognates: A.-Sax. hearm “harm, injury”, OE hurt (1200) “injure, wound”, ONFr. hurter “strike, collide”, MHG hurten “run at, collide”, ONorse hrutr “ram”, MDu. horten “knock, dash against”, Ir. dochar “hurt”, Scots Gaelic goirteachadh “hurt”, Welsh brifo (ur- > w/b-ur- > br-) “hurt”. The A.-Sax. hearm is a natural allophone of the hurt, with the auslaut -m a relict of the Türkic -im “me”, i.e. “(I am) beaten, stricken”, see harm. The ONorse moniker is clearly a semantic derivative. A complete absence of the cognates in the Romance and Asian IE groups, an obvious oddball loanword in ONFr., probably from an Alanic or Burgund source. No IE cognates whatsoever. The IE etymology stipulates “a word of uncertain origin”. In support of the PIE dogma, the IE etymology had invented some ethereal PG and PIE “proto-words” for the wound, and separate PG and PIE “proto-words” for the harm, while allowing hurt to come from an “uncertain origin” which is mysterious. The ubiquity in the Gmc. group attests to, first, a distinct areal northern and southern Sprachbunds, and second, to the later internalization by the northern Gmc. group, after the Romance farmers bounced back to the south-central Europe ca. 1500 BC, and the Indo-Aryan farmers departed on their South Asian trek ca. 2000 BC. The possible time for the internalization appears to be the a long period from the Corded Ware bloom to the Sarmatization of the North-Central Europe. The Türkic idiomatic compound ur- yon- combines two nearly synonymous expressions into an emphatic pair, of ur- “beat, strike” and yon- “hew, cut”, translated as “beat, torment”, see allophones of ur- harm, hurl. The Eng. and Gmc. allophones of ur- with prosthetic anlaut h- are attesting to a probable distinct path from that for the wound. That hypothesis is supported by the presence of the ONFr. hurter “strike, collide”, in contrast to the concentrated distribution of the cognates for the wound. The paradigmatic transfer of both words, ur- and yon- separately, and their attested compound in the Türkic emphatic idiom, provide irrefutable evidence of the Türkic origin for both words. See harm, hernia, hurl, wound.

English ignite (v.) “conflagrate, enkindle” ~ Türkic yak-, yaq- (v.) “ignite, burn”, a reflexive form of the verb ya:- “ignite shine, flame”, known from its numerous derivatives, and with parallel second forms yan- (v.) “ignited shine, flame” and yal- “blazed, burned, shined”. Normally the form yal- is intransitive, while the form yak-, yaq- is normally transitive, and can form a passive yakıl- “is ignited”. Cognates: A.-Sax. ael(an) “to ignite, kindle, light, set on fire, burn”, with numerous derivative forms: (on)ael(an) “set fire to, ignite, inflame, burn”, (on)ael(end) “incensor”, onaelet “lightning”; Lat. ign(ire) “set on fire”, a derivative of ign(is) “fire”; Balto-Sl. (Lith.) ugn(is), (OCS) ogni “fire”; Skt. agnih “fire”. The areal alternation of auslaut consonant -l/-n/-g expressing the same basic notion with different morphological elements attests to deep timeframe with attendant spatial differentiation, and different paths to the Central European A.-Sax. vs. Eastern European Balto-Sl., and Apennine Lat. Indo-Aryan Skt. forms a separate group. The deep time depth is also indicated by an accumulated collection of 7 active synonyms (ču:k-, kün-/gün-, ya:-, yak-, yal-, yan, yaru:-). Unlike the Tr. etymology that ascends to the original and attested base stem (ya:-), the IE etymology stops at unetymologized derivative (ign-), attesting to a loanword origin. Notably, the base stem ya:- is nearly a single-morpheme word, not found at the IE family. Looking at the variations of the base stem ya:-, ae-, i-, o-, a-, we can't tell how it was voiced in the initial period, with time depth and duration that can't be reconstructed. Probably none of modern phonetic graphemes can reflect the quality and variability of that vowel. We can be positive, however, that it was a primordial word at the root of the most widespread Eurasian designation for a start of a fire. The word öt (öd) “fire, flame” was its derivative. Eventually it grew into A.-Sax. ad, adwylm “whirlpool of fire” and Gk. adis (αδης) and ad (αδ, àä) for scorching “hell”. Both the primordial word and its “fire” derivative persevere through most of the Eurasia, form Atlantic to Pacific, and diffused way beyond their original Eurasian homeland.

Supplementary Note. Linguistic Model.

The Lat. word ign(is), ign(ire) and its Skt. form agnih, and other allophones in the IE languages, are held and cited as the fundamental proof of the IE linguistic paradigm. It is cited as uncontestable proof, and then etymologized as ascending to a bogus asterisked “IE proto-word”. A less myopic review disbands the incontestability of the argument, and thus seriously undermines the IE linguistic concept. If the IE linguistic theory had a definition of what constitutes the IE family, and that definition included the word ignite and its cognates, the definition would have to be re-defined. An absence of clear definition saves the the day for the Family Tree theory.

A switch to a concept of a series of areal Sprachbunds created by amalgamation of diverse local and migrant languages would resolve the uncounted controversies and theoretical chaff, and free scholars for explorations unburdened by patriotic prejudices.

English itch (v., n.) “pruritus, dermal irritation” ~ Türkic kiči:-/gici:- (kichi-/giji-) (v., n.) “itch”, and “scratch, tickle; lichens, scabies”. Cognates: A.-Sax. giccan, gicce, gicenes, gicða, (v., n.) “itch”, MDu. jöken, OHG jucchen, Gmn. jucken “itch”. No cognates outside of Gmc. and Tr. families. No IE etymology whatsoever. Distribution points to a Sarmatian origin: Vandals, Burgunds, Thurings, Alans, and the like “Wanderers”. The recorded OT sibling forms are kashı/qashï, the allophones closely match the oldest recorded English and Gmc. forms. Due to the flexibility of agglutination, each of the the Türkic four bases (emri-, êiči-, qašï-, qurtan-) potentially can serve as active and passive verb, and as active and passive noun. Eng. has lost that capacity and for a synonym has to turn to a foreign word used in medical trade, the Lat. pruritus. A fortuitous phonetic similarity between the root gici:-/gicce- and the root element -ich (bitch, glitch, pitch, witch, etc.) corresponds to the Tr. suffix -ich/-ish that forms abstract nouns. In A.-Sax., the double -cc- (giccan, gicce) apparently stands for -ch-/-č-, although at the beginning of the word -ch- is systematically depicted with a single c-. A phoneme depicted in Eng. with triplet -tch reflects the fricative -ch, probably with a hint of dialectal initial dental plosive -t- peculiar to A.-Sax. vernacular. The initial g- in the western forms (gi-, jö-, ju-) is consistent with the phonetic peculiarity of the Ogur languages. The survival of both forms (gicc etc. and itch) in the historical records indicates that they were brought over and used by at least two linguistically distinct groups. Probably, the Ogur form gi- was the first source, and the Oguz form i- supplanted it later. The phonetic match is perfect with correspondences regular, semantics perfectly identical, and both within a range of phonetic and semantic dispersion within the Türkic linguistic family.

English itinerate (v.) “go, travel” ~ Türkic ïd- (ïδ-, ïs-, ïz-, ïǯ-, ıy-, ı-) (v.) “send (to go)”. The Eng. “send” synonymically stands for “cause to go”. The Türkic verb ïd- is polysemous with the base connotation “go” and second connotation “intent”, retained in the Eng. idioms like “(I am) going (to go, to sleep, etc.)”, “(I was) going (to go to..., but...), etc.)”. Unlike Eng., the Türkic expressions largely imply but omit “go”, and include an intended action, like “send (him to go)”. Accordingly, translations also omit “go” and translate action, thus translation “I send a letter to (recipient)” tends to interpret the notion “send” as a meaning of the stem ïd-. Actually, it means “(I) send (a letter) to go to (the recipient)”. Aside from the common root iti-, the composition and history of the words “itinerate, itinerant, itinerary” are separate matters. Cognates: A.-Sax. at- (atgin conj. v. gin), adrran (intr.) “go”, Goth. iddja “went”; OIr. ethaim “(I) go” (Cf. Tr. idim, ıdayın ditto); Gaul. eimu “(we) go” (d-/y- transition); Balto-Sl. (Lith.) eiti “go”, (OCS) iti “go”, Bulg. ida “(I) go”, (Rus.) idti “go”; Lat. iter “travel”, ire “go”; Gk. eimi “(I) go”; Skt. e'ti “(he) goes”, imas “go (we)”, Av. ae'iti, OPers. aitiy “(he) goes”. Distribution of cognates corroborates both the broad historical picture of demographic movements in the western Eurasia, and the specialized disciplines of archeology and biology in respect to migratory flows, including those of the Kurgan Waves, Celtic, Indo-Aryan, and Sarmatian migrations in and out of Europe. The IE etymologists cite conjugations along with the roots, that allows to discern not only the roots, but identical conjugational agglutination. Cf. “(he) goes”: Tr. idti, Skt. e'ti, Av. ae'iti, OPers aitiy; “(I) go”: Tr. idim, OIr. ethaim, Gk. eimi. This degree of detail allows to trace grammatical metamorphoses of Tr. 3rd pers. perf. tense to Balto-Sl. infinitive form (suffix di/ti), or Tr. causative ide:r to Lat. iter “going”, and other transitions. Initially, the notion “go” was more literal, attested by the derivatives retained in the sister languages (Sl. idti, hodit (èäòè, õîäèòü, inf.) is an allophone of the Türkic idti (ıdtı) “send to go”, “made to go” (3rd pers. perf. tense, lit. “made (him, it) gone”). The Goth. cognate iddja “gone” (perf. tense) perfectly conveys the Türkic idti (ıdtı) semantically and grammatically. As a preposition (Goth. id-weitjan (v.), id-weit (n) “reproach”, lit. “going to approach”, OE ed-wit ditto, OHG. ita-wıč, ditto) it is an equivalent (calque) of the modern Eng. “going to”, where “go” expresses not a physical movement, but lit. or metaphorical “intent”. Notably, Goth. word precedes the LLat. word by better than half a millennia. That shows the direction of borrowings, and disbands the IE etymologies that ascribe the origin to the Lat. With a 2-phoneme single syllable word, chances of a random coincidence are fairly high, about 1/5 = 0.2 (phonetic coincidence 1/50, semantic coincidence 1/1000, 10,000 tries). A set of 5 10,000-word vocabularies from different languages would bring that probability to a near certainty. But bringing suffixes into a calculus would lower the chances to infinitesimally small number, requiring thousands of 10,000-word vocabularies to bring that probability to a certainty, an impossible task. Having ruled out a chance of random coincidence, a scenario of common origin is inevitable, and the Eastern Europe of the 3rd mill. BC becomes inescapable as a source from which, at different times and by different routs, the word could have traveled to the Mediterranean, Northwestern Europe, and South-Central Asia. The Celtic trail extends the existence of this word in the Eastern Europe, complete with its morphological modifiers, to the 5th mill. BC. Morphological evidence suggests an areal language or group of languages with largely shared morphological system consistent with the morphology of the Türkic languages down to a series of minute details. Actually, similarity is much closer than between the adjacent European languages of the same linguistic group, like French and Italian, Spanish and French, or Polish and Russian, and that is in spite of the great temporal difference between, say, Türkic and Avestan or Türkic and Old Irish. A complex of converging evidence unequivocally points to the Türkic phylum as an origin of the word. The phenomena of such retention opens a large window for further investigation. See period, time.

English jar (v.) “harsh, grating sound” ~ Türkic jar-, yar-, ya:r- (v.) “crack, swish, split, rive, cleave”. The Türkic stem jar- (v., n.) is extremely productive, with multifaceted semantics, including splitting and breaking. Semantic extension to the type of sound may be a reflection of the Türkic religious practices. The anlaut semi-consonant j-/g- is a trait of the Ogur languages, while the Oguz languages start with the vowel, ya-, hence forms jar- vs. yar-. The English semantics refers to a noise of a shaking, explosive sputtering like of a splitting log. No IE cognates. Jar-/yar- is widely used in Sl. languages: yarost (ÿðîñòü) “rage, fury, wrath”. In Lat. the homophonic and allophonic form jaru- (v.) “illumine” turned into an allophone aurora, presently an international word, see aurora. The IE speculation on echoic or imitative origin may not be totally baseless, though it is unlikely that the original Türkic jar- “crack, cleave” have originated from a splitting sound. A speculation on a relation to the homonymic Türkic words jar (n.) “vessel” and earl “title” is purely baseless. The chances that two homonyms jar- “sound” and jar “vessel” may accidentally evolve in two language families are nil, this is a salient case of paradigmatic transfer attesting to a common origin from the Türkic phylum. See aurora, earl, jar (n.).

English jeer (v.) “ridicule with contempt and derision” ~ Türkic jer- (v.) “deride, dismiss, reject, repel, slander, disparage”. In some semantic meanings, it is synonymous with the stem hool- in the word hooligan. Cognates: A.-Sax. gyr “to deride, to mock”, Gmn. scheren “to plague, vex”. The anlaut consonant points to the Ogur subfamily, the Oguz counterparts do not use a consonant in the anlaut: er. Oguric forms could have produced der-/jer-/ger-/ɣer-/her-, and the like. Not a trace of IE etymology, even a most fanciful, the best is “of uncertain origin”. See hooligan.

English jerk (v.) “abrupt movement” (Sw N/A, F1589 0.005%) ~ Türkic jul (v.) “jerk, pull (out, away), extract, extricate”. English has it with a connotation of slang, an illegitimate word, first recorded in 1570s as “sudden sharp pull”. The Türkic verbal form, like the English variety, has numerous derivatives associated with abrupt move, unsteady affect, clinching action. The semantic coincidence is perfect, a phonetical correlation with liquid -r-/-l- is credible, the final -k/-g suggests a version with Türkic suffix -k/-g/-ɣ or their modifications. The Gmn. Zuck and Ruck with various Gmc. variations point to a wide phonetic dispersion of the verbal stem, with numerous derivatives. The English derivatives “perform male masturbation” (jerk off), “masturbator, tedious and ineffectual person” (jerk, n.), adjectival “inferior, insignificant” have Slavic allophone droch- with similar derivatives. IE rated “of uncertain origin, perhaps echoic”, no IE concoctions offered. The geographic spread of the allophonic forms suggests an early scatter, pointing to the Kurgan or Sarmatian waves from the E.Europe. The initial consonant j-/z-/d- points to the Ogur-type languages of the Sarmatians, Huns, and Alans.

English jig (v.) “jerk, yank” ~ Türkic jïq-, yık- (v.) “bring down, plunge, cast down, fell, crush (by blow)”, (n.) “spindle”, with usual phonetic changes jeg, jïk, yıx. English spelling comes in two flavors, jig and gig, reflecting the Ogur anlaut consonant. In Oguz version the word starts with semi-consonant or vowel yig-, ig-. The Türkic verbal notion largely implies down, Eng. movement is not specifically down. No IE or any other etymology whatsoever. There is no need for the OFr. “proto-word” gigue “fiddle” and its Romance cognates, the truth is much closer to home. The form jiggle (v.) may be diminutive or frequentative, as claimed by the European philologists, or might as well be a Türkic regular denoun verb jiglä that smoothly diffused into English to be eventually re-interpreted.

English jog (v., n.) “walk or ride” ~ Türkic jag-/yag-, jaɣ-/yaɣ- (v.) “go, fall, run” related to movement: event goes, snow falls, time runs”, (adj.) yadaɣ “on foot”. The inlaut -d- comes fr. adaq “foot”, adaɣ, yaδaɣ “on foot”. Key noun/adj. notion means “horseless”, and thus “pedestrian, poor, weak, piteous”, and military “infantry, unmounted”. Cognates: None outside Türkic; metaphoric extensions “poor, weak, piteous”: Manchu jada-/yada- “sickness”; Mong. jadaɣu “poor, weak”, Cf. Türkic jadïɣ/yadaɣ “horseless, on foot”; Evenk ǯadaŋ, ǯaday “grow poor”. Distribution - Eurasian width with western and eastern appendages. No IE reconstructive concoctions. IE etymology is rated “of uncertain origin, perhaps used in horsemanship” that somehow alludes to the proper motif, but blurs a specifically horseless context. The crispiness of the Türkic semantics vs. metaphorical loanwords is predicated by the mounted existence of the Türkic Kurganians vs. sedentary neighborhoods. The Türkic-Mongol linguistic similarities ascend to 93 AD, when 100,000 Hunnic families numbering 500,000+ people submitted to the ruling minority of the Syanbi (pin. Xianbei 鮮卑) Mongols (Western Tunguses marked by Y-DNA haplogroup C vs. Türkic R1), essentially making Hunnic language a language of the Syanbi, and continuous amalgamation in the following centuries. See jag, yacht.
jog (v.) 1540s, "to shake up and down," perhaps altered from Middle English shoggen "to shake, jolt, move with a jerk" (late 14c.), a word of uncertain origin. Meanings "touch or push slightly," "stir up or stimulate by hint or push," and "walk or ride with a jolting pace" all are from 16c.

English journey (v., n.) “travel” ~ Türkic jor-, jorï-, yor-, yorï-, jur-, yürï-, jürü-, yuɣur- (v.) “go, walk, march”. In contrast with the verb bar- “go” (directional, one-time event), yör- “go” is omnidirectional, recurring. Its primeval status is attested by exceedingly diverse forms and semantic excursions. Forms carry transformations ascending to an oldest form yorï-, from jor- to su:r and the like. Like a species found on eight continents, forms have eight variations, each with its own overlapping semantic clusters. That allows phonetic sleuths to browse along evolutionary trails to discover mechanisms of the changes logically arranged in a neat continuous Family Tree chain eidetic to biological evolution, without any unruly stochastic effects endemic to real lexicons. These disrupting effects do however pop up in various documents with parallel forms encountered in the same texts. Additionally, the word serves widely as an auxiliary verb and components of compounds, and populate diverse Türkic languages each with its own history. Effects refuse to collapse into a neat order, forcing a switch from a shiny scientific evolution to a purely observational ledger. Cognates: No IE cognates whatsoever: OFr. journee “day's, daily” and “journey” are homophones, the first one is a cognate with Lat. diurnalis “daily” fr. a root dies “day”. The second is a cognate with Türkic jor-/jur- that does mean “travel”; Ir. turas “journey”; Mong. ǯori- (/djori-/) “head to, intend”; Tungus düre- “walk”; Yürük, Yörük - Türkic tribe, lit. “nomad, rover”; Kor. jeongdo (정도) “journey”. Distribution typical for Türkic loanwords - Eurasian width with some western and eastern appendages. No IE reconstructive concoctions. The verb came to light in 12th c., linked to OFr. journee (v., n.). If that is so, it became a European word fr. the Alanian, Burgund, or Vandal lexicon. The prosthetic j- points to Scythian, Sarmatian or Hunnic origin; in the Oguz languages it is pronounced with anlaut y-: yor, yorï. Typical for lexemes coming from different sources, journey is synonymous with voyage from LLat. viaticum “journey”, traverse from Lat. transversare “journey, pass across”, and its contracted derivative travel “journey”. The noun journal “daily ledger” is derived from the noun “journey” or from LLat. diurnalis “daily (ledger)” etymologically conflated with “journey”. The Türkic etymology is direct and straightforward, the IE etymology is artificial and tenuous. Chances that two identical three-phoneme words can exist in two independent languages are quite high, but chances that those same two words would happen to share a same semantics, in this case “journey”, are infinitesimally small. See amen, you.

English jut (v.) “protrude”, jut (n.) “precipice, brink” ~ Türkic jalt (adj.) “precipitous, brink (adj.)”. Cognates: Eng. jetty “seawall”. The IE suggestion of cognates jet (v.) “stream” or “throw” (v.) are semantically unrelated; thus no coherent IE etymology. The jut “fiber”, Jute “people”, and adjutant “helper” are also semantically unrelated accidental homophones. Elision of inlaut liquid -l- should be expected. Articulation of the initial j- is harder in the Ogur languages and lighter y- in the Oguz languages (jalt vs. yalt). Phonetics is near perfect, and semantics is precise.

English know (v.) “cognize, recognize, perceive” (Sw N/A, F15, Σ1.02%) ~ Türkic köni, göni (n.) “verity, truth”, köŋül, göŋül, köngül, göwün, kölün (n.) “consciousness, thought, mindset, memory, meditation, conviction”, q:an-, qanï-, ka:n- (v.) “cognize, understand, consider, convinced, reckon”. Phonetic proximity of the forms connected with knowledge and mind points to their once shared origin. In the semantic hierarchy of each stem, they occupy a secondary status behind a prime meaning. That shows a drift, secondary meanings associated with a prime meaning due to phonetic and semantic proximity or due to metaphoric content, e.g. “straight” and “truth”, “heart” and “truth”. The Türkic notion “know” has more than 20 synonyms, of which 10 have distinct stems, each with its own pleiad of grammatical derivatives. The notion “know” (v.) fr. a verb köni is one of at least 31+ meanings in 7+ clusters. Ultimately, it is a derivative of the verb kön-/gön- “right (something), make things right” with a rich trove of derivatives. The notion “know” is but one lesser facet there. Besides “know”, Gmc. languages propagated numerous carryover meanings connected with the Türkic base kön. The notion “know” (v.) fr. a verb q:an- is one of at least 23+ meanings in 5+ clusters. A main cluster denotes a notion “can (execute, achieve a desire, able)”, see can. A notion “know” is one of the secondary clusters. That created a bifurcated semantic split of “can” and “know”. What appear to be homophones are two offshoots in a pack of siblings from a single parental verb. For uninitiated etymologists, situation is confusing and confused. Gmc. cognates are semantically split, and viscerally are treated as homophonic lines. Initially common in Gmc., the verb fully survived only in the A.-Sax. English.Cognates: A.-Sax. can, con “learn, know” and “can, be able to”, cnawan (also with prefix a-, be-, to-) “know, perceive”, cnawlaec “knowledge”, cnawe “conscious”, etc., all opposed to “believe”, ONorse kenna “know, make known”, OFris. kanna “to recognize, admit”, OHG chnaan (also with bi-, ir-) “know”, Gmn. kennen, erkennen, können “know”, Goth. kannjan “to make known”, kann know”; Ir. gnath “known, famous”; Balto-Sl. (OCS) znati, (Lith.) žinoti, žinau, (Latv.) zinu, zinat; Fr. connaitre “perceive, understand, recognize”; Lat. gnoscere “know, recognize”, agnoscis “recognize”, cognoscere “get to know, recognize”; Gk. gnos (γνωσ), gnosko (γνωσκω) “knowledge”, gnosiz (γνωις) “cognition”; Alb. njoh; Skt. janati; Kuchean knan, all “know”; Mong. qan-, qanu- “meet desired”. A native A.-Sax. term for “know” was wit, witt (n.) “understanding, intellect, sense, consciousness”, witan (v.) “know, understand, aware, conscious, observe, perceive”. The A.-Sax. wit is not too far from the Türkic equivalents: bil-, pil- “learn, know”. A transition b-/p- > w- is regular and should be expected, a transition -l- > -t- is less obvious. If the root is bi-, -l- is an impersonal verbal voice marker, -t- is causative or passive verbal voice marker, bit ~ wit. In a range of languages, more cognates of the root can- are derivative, Cf. Türkic qa:n, Manchu qantara “impatiently wait (~ desire)”, Türkic qantar “caprice, whim (~ desire)”. The line-up of the cognates is remarkable, the Gmn. können and the Türkic könen-, a verbal derivative of the verbal root kön-/gön- denoting “enjoy, rejoice, achieve prosperity” and a meaning “know”. The two are exactly identical phonetically and rigidly linked semantically. The Gmn. können is universally recognized as a cognate of all the cognates listed above. That is indelible attestation of a common origin. Versions of the form and semantics of the related cant “tilt, twist” (notion “straight” vs. “non-straight”), rated as “a word of uncertain origin”, are found in many languages: Teutonic, Slavonic, Romanic, Celtic, Latin cantus (OED). Distribution of the can “know”, common to the Germanic and Türkic languages, proves its etymological link. In the east, the later Türkic forms köni, q:an- “know” did not gain popularity. That corroborates a thesis that the European-Türkic amalgamation took place within the confines of the Corded Ware period and space. In the west, A.-Sax. carried the conventional bifurcated semantics eidetic with the Türkic original. Another trait is a common trend starting with the Goth. a Gmc. shift to the semantic prong “know” while the notion “can, able” was filled by older words haban “have” and magan “might”. The near-perfect phonetic and perfect semantic congruence attests to the common origin of the word can “know” on top of the A.-Sax. wit's archaic link to the bil-/pil-. Unwittingly, the suggested faux “PIE root” *gno- “to know” is an allophone of the real and attested Türkic root kön-/gön-/q:an- “to know”. The genetic connection debunks an IE etymological idea that the diphthong consonant cluster kn- was a result of peculiarly European process of regular phonetic shifts. In reality, it was a predictable vowel elision in a process of internalization with a particular propensity to consonantal clusters that converted syllable kön-/gön-/q:an- into a consonant diphthong cluster kn-, with succeeding elision of k-. Like the can “know” and the can “able”, the magan “might” and cant “tilt”, and the nearly synonymous pair deem and know present a solid authentic case of paradigmatic transfer from the Türkic linguistic phylum to the Gmc. and other neighboring languages, an indelible testimony of a common genetic origin. See can, deem, do, I, might, this.

English keep (v., n.) “hold” (Sw N/A, F184, Σ0.11%) ~ Türkic kap- (qab-/qap-) (v.) “keep, seize, grab, hold”, ultimately from from kap (qab/qap) (n.) “container, vessel, box” (lit. interpretation) or a shape of the container contents taking a shape of the container (metaph. interpretation), like frozen water in a container, see cup. The verb verb has 7+ semantic clusters containing at least 30+ meanings, a prime meaning “grab”. Related to Eng. capture, cab, cap, cup, cabbage, cable, have, and far beyond, reflecting 39 derivatives listed in a small Turkish dictionary plus independent European innovations. Cognates: A.-Sax. cepan (kepan) “keep, guard” (also with prefix a-, be-, etc.), ONorse hafa, OSax. hebbjan, OFris. habba, Gmn haben, Goth. haban “keep, have, possess, hold”; It. capisci, Sp. capiche, Lat. capere “grasp, seize”; Fin. kaapata “capture”, Hu. kap “receive, get”; Mong. havshix “bite (fish)”, Kalm. xawl- “catch (ball), bite (fish)”. Distribution is typical for Türkic loanwords: Eurasian steppe width with some far western and far eastern appendages, crossing linguistic barriers. Clauson gives kıp as an allophone of kap (EDT: 580). Thus the phonetic and semantic match is perfect. No sane IE or any other etymology whatsoever. The IE etymology asserts “no certain connection to other languages”, i.e. other than Gmc. a funny claim. The Gmc. “IE” etymology is thoroughly confused, unwittingly mixing up two independent Türkic words, one kap- for “capture, keep”, and the other kö:z/gö:z- for “gaze, look”. It suggests a contorted phantom “PG” *kopijan, a faux “PIE proto-word” *kap- mechanically derived from A.-Sax. cepan with two independent meanings and Türkic verbal instrumental suffix -an, and a faux “PIE proto-word” *gab- “to look after”. It further suggest a faux Romance “proto-word” *kapio and faux “PIE proto-word” *kh2pyéti fr. a faux root *keh2p- “grab, seize”, a very special pleading for a IE Romance case. Incredibly, while expert gurus discovered a fictitious proto-word “gaze”, they have completely missed the ubiquitous Gmc. “have” listed as a cognate of “keep, hold”. The omnipresent importance of the “have” in Gmc. languages is supreme, it fills 1.25% of the English daily lexicon, or every 80s word in a random English text. Too hard to miss. See cap, cup, capture, have.

English kick (v., n.) “strike” (Sw N/A, F1206 0.01%) ~ Türkic kik- (v.) “parry, punch”, strike two objects, like honing knifes by whetting one knife against another (Kashgari II 293); waves parry; arguing people parry. Most of secondary meanings are connected with anger and conflict, Cf. Mong. (Kalm.) cognate kekixǝ “angered, enraged, fighty”. No IE cognates, non-IE word, the origin is a standard “of uncertain origin”. A Türkic noun kik “wild animal” points to a notion “attacking, rushing (animal)”. The Türkic twin physical and figurative semantics presents a case of a paradigmatic transfer with perfect phonetics and peculiar semantics: eidetic kick “parry, fence” and kick “quibble, argue, retort, recoil” (Cf. Tr. kikinč “retort”, kikšür- “argue” and a derivative kikne:- “flurry, derange”). The word was used in Türkic Buddhist religious lingo centuries before and after the turn of the eras, before the advent of the Manichaeism and Nestorian Christianity. In Khakass (Enisei Kirgiz) lexicon it related to whetting, that was a primary, and a metaphorical religious term was a secondary semantics. The dignified Buddhist word used in the translation of the Bible in Eng. became standing parliamentary idioms: “kick the can down the road”, “kickbacks”, and “get a kick out of”. Reference to whetting by banging knifes points to the Iron Age. The atypical scarcity of Türkic derivatives points to a relatively recent innovation. The same is also attested by the limited geographic distribution and the atypical absence of a cluster of dialectal allophones. The case of paradigmatic transfer provides a positive attestation of a genetic origin from the Türkic milieu.

English kill (v.) “cause to die” (Sw N/A, F320, 0.08%) ~ Türkic kıl-/qıl- “do, make, act”, Chuv. vil-/wil- “kill”. The end links of Sum. gil “kill” and Eng. “kill” are joined by the links of Türkic qıl- “do” and öl “kill”. The Eng. mysterious do in “kill” and the phrases like “to be done” alluding to irreversible deed precisely mirror the imprecise Türkic semantics of the verb qıl- “do”. From auxiliary semantics “do (something)” with a myriad applications, the Eng. verb retained one, probably most drastic and frequently used, a semantics of a concrete verb “kill”, “make dead”. Eng. retained both of the two Türkic synonymous service words, kıl-/qıl- “kill” and tu-/tur-/dur- “do” that refer to, but not specify, a concrete action or result, see do. This is a case of paradigmatic transfer unambiguously attesting to a genetic connection. The hypothesis appear viable, none of the IE auxiliary action verbs meet the the perfect tense semantics embedded in this auxiliary verb. Confinement of the verbs do and kill to the Gmc. group attests to their guest status within the IE family, and there are no prohibitive indicators against this hypothesis. A somewhat more phonetically tenuous candidate is the Türkic öl- “to die”, ölüm “death”. The A.-Sax. root cwell of the form cwellan formed the modern form kill (ca 1200). A transition from the form öl- to cwell requires a two-step prosthetisation, first with an attested prosthetic w- (wöl-, Cf. Chuv. form), and then with a prosthetic k- (c-). The form wöl- “death” is attested with ligatures e-/wae-/wea-/we-. The word wael is a phonetic equivalent of the form öl “to die”, it is attested in the Chuv. vil-/wil-. The word waelm (ölm) is a phonetic equivalent of the form ölüm “death”. In both cases, the part -üm/-m is a Türkic/Anglo-Saxon deverbal noun suffix. These forms do not carry a perfect tense connotation, that connotation is formed with a prefix k- (c-) in the form cwell “kill”. Difficulties in transition to the Roman alphabet are illustrated by the inconsistency of spelling, Cf. el, wael, wel, weal, wiel. The Clark-Hall, 2011, Anglo-Saxon Dictionary, lists a rich trove of 104 words derived from the word wael “death”, including those with prefixes a-, be-, c- (cwelan “to die”), etc. The compound c- + wael forms the words cwalu (n.) “klling, murder, violent death, destruction”, cwellan (v.) “to kill, murder, execute”, cwellend “killer, slayer”, and a trail of derivatives with allophones. The form with the prefix c- (k-) turned out to be most productive, it corresponds to the IE prefix s- that forms perfect tense (s-/k- alternation, aka satem-kentum split), and thus is most archaic. Tentatively the form öl- might be a contraction of a form with an initial consonant or semi- consonant. Other cognates: A.-Sax. wael “slaughter, carnage”, Waelheall “death-hall” (see hall), with ligature wae- in Wael rendering -ö-, OSax. cuellian “to kill, torture”, Goth. waltjan “beat upon”, Goth. expression Walhalla (Valhalla) “death-hall” (Hall of dead), an allophone of the Türkic ölqalïq “death-hall” and A.-Sax. allophone Waelheall “death-hall”, ONorse kvelja “to torment”, MDu quelen “to vex, tease, torment”, OHG kuellan “to suffer pain”, Gmn. quälen “to torment, torture”; Arm. kelem “I torture” with Türkic 1st pers. sing. suffix -em; Hu. gyil, öl “kill” have preserved both Oguric and Oguzic forms, the Ogur form gyil matches the Sum. gil and Eng. kill; Sum. gil “kill”, gilal “butcher”, of the base öl- with Ogur prosthetic initial g-/k-. The IE etymology dies not consider these clones of the Türkic öl-. Instead, it starts with the Anglo-Saxon/OSax. derivative form cwellan that, naturally, leads to nowhere. The Türkic kil- and Sum. gil “kill” appear to be the closest to the Eng. kill, the other Cmc. and Hu. forms appear to be local dialectal adaptations. With the abundance of the attested forms, there is no need to simulate innocence, no place nor a need for a Proto-Gmc. quasi-reconstruction *kwaljanan with piled prefixes and suffixes obscuring the root stem. There is no need for the mental equilibristics with the newly invented PIE root *gwele-. In addition to the notion “death”, Türkic has more than one way to express the notion “kill”, some ascending to the first documented Sumerian records. The Türkic notion “kill” is also conveyed with the stems for the notions “beat” and “gore”. A.-Sax. and Eng. also have more than one way to express the notion “kill”, a number of them must ascend to the yet unexplored “Old Europe” Sprachbunds of the pre-IE family. Herodotus IV. 110, attested the Scythian pata- (v.) “to strike, to kill” in the compound eorpata - those who are striking their husbands (Türk. er “man, husband”), the Sum. form badd is the oldest record for Eng. bat (v. and n.) ascending to the 4th mill. BC, see bat, pat. “Gore” is a truly Eurasian word, extending from Scotland to Japan, see gore. An allophone (or a homophone) of the form “kill” is present in the Türkic qïlïč (kılıč, kilich) “sword”, a killing tool, a loanword into Pers. and other languages. Given the Sum. gil “kill”, the precise semantics makes a random coincidence unlikely, whilst its origin is held from a Türkic stem qïl- “make, do”. That echoes the Eng. inexplicable idiom “do (him) in” for the verb “kill”, using the qïl- “do” for “do” would make the idiom to ring “qïl (him) in”, a striking incidence. The Sum. form ascends to the 4th mill. BC, Sum. is unrelated to the IE family, that brings the origin of the term kill via allophones gil and öl- way beyond the width and depth of the extant IE etymology. See bat, battle, do, gore, hall, mist, pat.

English laugh (v., n.) “guffaw” (Sw N/A, F1337 0.01%) ~ Türkic gül-, kül-, wül- (v., n.) “laugh, laugh at, smile”. The Eng. laugh is a phonetic transformation of the allophonic hlæ- of gül-/kül-to A.-Sax. hlæhhan, hliehhan, hlihhan, -hlyhh-, hlag-, hlað-, lað-, leahtor- (hleahtor-) “laugh” (spelled with infinite suffix -an) with elided anlaut h- (Tr. g-/gü-), softened inlaut -hh-/-g-, also transcribed -h-, and shed conjugational suffixes. The vowel transcription -æ-, -ie-, -y- corresponds but not necessarily reflects the CT rounded -ü- or the -e- of the Caspian basin languages. Both the h- and l- forms are recorded for the A.-Sax., attesting to the elision possibly commencing still on the continent, either during or before the amalgamation of the Anglian and Saxon, within the Saxon/Sekler/Scythian vernaculars, or at the conflation of the future Scythian, Sarmatian, and the Corded Ware vernaculars. Cognates: A.-Sax. hlæhh, leah, OSax. hlahhian, OFrisian hlakkia, ONorse hlæja, OHG hlahhan, Goth. hlahjan, Dan. le, MDu., Du., Gmn. lachen, Lith. klageti “cackle”. Erroneously surmised as cognates the Eng. cackle, OHG kachazzen, Lat. cachinnare, Skt. kakhati," OCS chochotati (õîõîòàòè), Gk. kakhazein, Arm. karkach (կարկաչ) are allophones of the Türkic kakı:la:-, kakı: “cackle, cachinnate”, which is stronger than laugh, see cackle. Distribution is consistent with other English Turkisms, across Eurasian Steppe Belt plus NW European fringes. No IE cognates, positively no connection to IE family. The “PG proto-word reconstruction” *hlahjaną is a mechanically assembled nonsensical faux “proto-word” divorced from etymology, history, and demography. Of the five Türkic terms related to laugh, four had transferred paradigmatically to Eng.: gül- as hlæ-/laugh and hilarious; kakı:la:- as cackle, cachinnation, gag; jer- as jeer; elük as joke. These four Türkic words complemented and supplanted the corresponding native European lexicon that came as a part of the A.-Sax. language but faded with time. The paradigmatic transfer of the Türkic quartet testifies to the common genetic origin from a Türkic phylum. See cackle, hilarious, jeer, joke.

English lie (v.) “deceit”, liar (n.) “deceiver” (Sw N/A, F596 0.04%) ~ Türkic yalan, yalgan (yalɣan, yalğan) (adj., n.) “deceitful, false, deception, lie/lies”. Ultimately fr. a verb yala:- “suspicion, incriminate” fr. a stem yal “ignite, heat up, flair up”, a derivative of a verb ya:- “ignite shine, flame” which produced the IE ignate (see ignate ) and its passive form yal- “blazed, burned, shined”. Gmc. including English have retained an elided stem lag- with a deverbal adj. suffix -ɣ-/-g- “lying”. Cognates: A.-Sax. leogere “liar, false witness”, OSax., OHG liogan, OFris. liaga, ONorse ljuga, Dan. lyve, Goth. liugan, Gmn. lugi, liogan, lügen; Ir. log-, logaissi “mendacious”, fol-lugaim “I hide, conceal”; Sl. yala (ÿëà) “punish”, lgat (ëãàòü) “lie”, Balt. (Lith.) luginaite “traitorous”; Mong. žali “deceit”, zalga:, zalgas “false (braid)”. The word yalɣan was borrowed into Mari, Lezgin, and Balkan languages (Sevortyan 1989 vol. 4 p. 91). Distribution spans Eurasia from Atlantic to Far East, crossing linguistic borders adjacent to the Eurasian Steppe Belt. Gmc. forms are fairly uniform, attesting to a stable Sprachbund associated with the society of the Corded Ware period. Türkic nominal liar is an adjectival paired compound yalɣan ar (“lying man”), hence the English synonymous contracted form liar. No IE parallels. IE etymological speculations do not strain too much, rating the word with routine “a word of uncertain etymology”. Neither Eng. nor Sl. etymologies are deeper than Balt. and Goth. Instead, a long list of etymologists offer some diverse nonsense. In contrast, Turkological studies made good inroads pursuing Altaistic models. In the archaic Türkic culture, lying was among the greatest human vices, so the word had grave connotations; vestiges of that attitude still survive in the British culture, to a much lesser degree in the American culture, and fairly well in the modern Türkic cultures. That trait stands in sharp contrast with the Near Eastern – Mediterranean cultural heritage.
“” “”  “” “” “” “”  = ≠ ~ ≈ ≡ ± ° – → Ø Sevortyan E.V., 1974, (EDTL) Sevortyan, 1974, 39

English lull (v., n.) “hush to sleep, hum a lullaby” ~ Türkic ulï- (v.) with some variations (-a-/-u-) “wail, moan, bellow”. The lull is connected with lullaby and a cognate of ululate, which is a derivative form of ulï-. Cognates: OE, MDu., Gmn. lullen, lollen “hush to sleep” (same word, cited with with a suffix), Sw. l