25:
234:
301:
file, but that it has no information beyond "the text itself"—no representation of bold or italics, paragraph, page, chapter, or footnote boundaries, etc. Michael S. Hart, for example, argued that this "is the only text mode that is easy on both the eyes and the computer". Hart made the correct point
327:
In consequence of this, such texts cannot be reliably re-formatted. A program cannot reliably tell where footnotes, headers or footers are, or perhaps even paragraphs, so it cannot re-arrange the text, for example to fit a narrower screen, or read it aloud for the visually impaired. Programs might
323:
might be omitted, or might simply appear as additional lines of text, perhaps with blank lines before and after (or not). An ornate separator line might be represented instead by a line of asterisks (or not). Chapter and sections titles, likewise, are just additional lines of text: they might be
302:
that proprietary word-processor formats made texts grossly inaccessible; but that is irrelevant to standard, open data formats. The narrow sense of "e-text" is now uncommon, because the notion of "just vanilla ASCII" (attractive at first glance), has turned out to have serious difficulties:
363:
output usually produces more information than this, such as the use of bold and italic. If this information is not kept, it is expensive and time-consuming to reconstruct it; more sophisticated information such as what edition you have, may not be recoverable at all.
371:, spaces, tabs, and the like: Spaces between words; two returns and 5 spaces for paragraph. The main difference from more formal markup is that "plain texts" use implicit, usually undocumented conventions, which are therefore inconsistent and difficult to recognize.
374:
The narrow sense of e-text as "plain vanilla ASCII" has fallen out of favor. Nevertheless, many such texts are freely available on the Web, perhaps as much because they are easily produced as because of any purported portability advantage. For many years
344:
relating to the text is sometimes included with an e-text, but there is by this definition no way to say whether or where it is preset. At best, the text of the title page might be included (or not), perhaps with centering imitated by indentation.
339:
the work. For example, is it the first or the tenth edition? Who prepared it, and what rights do they reserve or grant to others? Is this the raw version straight off a scanner, or has it been proofread and corrected?
175:
information, or not. An e-text may be an electronic edition of a work originally composed or published in other media, or may be created in electronic form originally. The term is usually synonymous with
324:
detectable by capitalization if they were all caps in the original (or not). Even to discover what conventions (if any) were used, makes each book a new research or reverse-engineering project.
356:, or even the simplest tables. This leads to endless practical problems: for example, if the computer cannot reliably distinguish footnotes, it cannot find a phrase that a footnote interrupts.
309:
or the accented vowels used in many
European languages cannot be represented (unless awkwardly and ambiguously as "~n" "a'"). Asian, Slavic, Greek, and other writing systems are impossible.
453:
315:
Third, "e-texts" in this narrow sense have no reliable way to distinguish "the text" from other things that occur in a work. For example, page numbers,
312:
Second, diagrams and pictures cannot be accommodated, and many books have at least some such material; often it is essential to the book.
441:
Reading and
Writing the Electronic Book. Nicole Yankelovich, Norman Meyrowitz, and Andries van Dam. IEEE Computer 18(10), October 1985.
422:
220:, and other information in their texts, as well as in some cases (such as FRESS) supporting not just text but also graphics.
143:, and especially a document that is mainly text. For example, a computer-based book of art with minimal text, or a set of
480:
290:
In some communities, "e-text" is used much more narrowly, to refer to electronic documents that are, so to speak, "plain
277:
108:
89:
259:
379:
strongly favored this model of text, but with time, has begun to develop and distribute more capable forms such as
61:
335:
Fourth, and a perhaps surprisingly important issue, a "plain-text" e-text affords no way to represent information
537:
244:
46:
68:
360:
348:
Fifth, texts with more complicated information cannot really be handled at all. A bilingual edition, or a
75:
471:
192:, have been around since long before the Internet, the Web, and specialized E-book reading hardware.
417:
57:
255:
42:
35:
532:
205:
164:
8:
291:
189:
136:
305:
First, this narrow type of "e-text" is limited to the
English letters. Not even Spanish
542:
494:
412:
376:
368:
498:
82:
484:
402:
353:
349:
407:
213:
168:
251:
172:
526:
212:
appeared in the 1960s. These early systems made extensive use of formatting,
148:
193:
140:
442:
316:
160:
152:
126:
515:
489:
466:
465:
Coombs, James H.; Renear, Allen H.; DeRose, Steven J. (November 1987).
329:
298:
217:
156:
144:
392:
201:
262:. Statements consisting only of original research should be removed.
24:
367:
If actuality, even "plain text" uses some kind of "markup"—usually
341:
320:
197:
397:
177:
151:, would not usually be called an "e-text". An e-text may be a
294:
209:
467:"Markup systems and the future of scholarly text processing"
380:
306:
200:
in the 1940s, while large-scale electronic text editing,
332:
to guess at the structure, but this can easily fail.
297:". By this is meant not only that the document is a
49:. Unsourced material may be challenged and removed.
464:
524:
352:with footnotes, commentary, critical apparatus,
223:
517:Scholarly Electronic Publishing Bibliography
196:began developing an electronic edition of
488:
423:L'Association des Bibliophiles Universels
278:Learn how and when to remove this message
109:Learn how and when to remove this message
16:Any document that is read in digital form
204:, and online reading platforms such as
525:
443:http://dl.acm.org/citation.cfm?id=4407
227:
47:adding citations to reliable sources
18:
13:
14:
554:
509:
183:
232:
216:, automatic tables of contents,
23:
34:needs additional citations for
458:
447:
435:
1:
428:
135:) is a general term for any
7:
386:
258:the claims made and adding
10:
559:
472:Communications of the ACM
418:Distributed Proofreaders
131:"; sometimes written as
159:file, viewed with any
538:Electronic publishing
167:. An e-text may have
190:electronic documents
165:proprietary software
43:improve this article
490:10.1145/32206.32209
369:control characters
243:possibly contains
413:Online Books Page
377:Project Gutenberg
359:Even raw scanner
288:
287:
280:
245:original research
224:"Just plain text"
119:
118:
111:
93:
550:
503:
502:
492:
462:
456:
451:
445:
439:
403:Electronic paper
354:cross-references
350:critical edition
283:
276:
272:
269:
263:
260:inline citations
236:
235:
228:
139:that is read in
114:
107:
103:
100:
94:
92:
51:
27:
19:
558:
557:
553:
552:
551:
549:
548:
547:
523:
522:
512:
507:
506:
463:
459:
454:Michael S. Hart
452:
448:
440:
436:
431:
408:Digital library
389:
284:
273:
267:
264:
249:
237:
233:
226:
186:
115:
104:
98:
95:
52:
50:
40:
28:
17:
12:
11:
5:
556:
546:
545:
540:
535:
521:
520:
511:
510:External links
508:
505:
504:
457:
446:
433:
432:
430:
427:
426:
425:
420:
415:
410:
405:
400:
395:
388:
385:
286:
285:
240:
238:
231:
225:
222:
185:
184:E-text origins
182:
149:scans of pages
117:
116:
31:
29:
22:
15:
9:
6:
4:
3:
2:
555:
544:
541:
539:
536:
534:
533:Books by type
531:
530:
528:
519:
518:
514:
513:
500:
496:
491:
486:
482:
478:
474:
473:
468:
461:
455:
450:
444:
438:
434:
424:
421:
419:
416:
414:
411:
409:
406:
404:
401:
399:
396:
394:
391:
390:
384:
382:
378:
372:
370:
365:
362:
357:
355:
351:
346:
343:
338:
333:
331:
325:
322:
318:
313:
310:
308:
303:
300:
296:
293:
282:
279:
271:
261:
257:
253:
247:
246:
241:This section
239:
230:
229:
221:
219:
215:
211:
207:
203:
199:
195:
191:
181:
179:
174:
170:
166:
162:
158:
154:
150:
146:
142:
138:
134:
130:
128:
123:
113:
110:
102:
91:
88:
84:
81:
77:
74:
70:
67:
63:
60: –
59:
55:
54:Find sources:
48:
44:
38:
37:
32:This article
30:
26:
21:
20:
516:
476:
470:
460:
449:
437:
373:
366:
358:
347:
336:
334:
326:
317:page headers
314:
311:
304:
289:
274:
265:
242:
194:Roberto Busa
188:E-texts, or
187:
141:digital form
132:
125:
121:
120:
105:
99:January 2013
96:
86:
79:
72:
65:
53:
41:Please help
36:verification
33:
483:: 933–947.
161:open source
145:photographs
527:Categories
429:References
330:heuristics
299:plain text
268:April 2015
252:improve it
218:hyperlinks
173:formatting
157:plain text
127:electronic
69:newspapers
543:Documents
393:Text file
321:footnotes
256:verifying
202:hypertext
171:or other
499:59941802
387:See also
342:Metadata
137:document
58:"E-text"
292:vanilla
250:Please
206:Augment
198:Aquinas
124:(from "
83:scholar
497:
479:(11).
398:e-book
328:apply
319:, and
214:markup
178:e-book
169:markup
153:binary
122:e-text
85:
78:
71:
64:
56:
495:S2CID
337:about
295:ASCII
210:FRESS
155:or a
133:etext
90:JSTOR
76:books
381:HTML
208:and
129:text
62:news
485:doi
481:ACM
361:OCR
254:by
163:or
147:or
45:by
529::
493:.
477:30
475:.
469:.
383:.
180:.
501:.
487::
307:ñ
281:)
275:(
270:)
266:(
248:.
112:)
106:(
101:)
97:(
87:·
80:·
73:·
66:·
39:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.