Creation of electronic documents. Software for creating documents on a computer. word processors are used to create and process electronic documents.

An electronic document can be used in all areas of activity where software and hardware necessary for creating, processing, storing, transmitting and receiving information are used. With the help of electronic documents, transactions can be made (contracts), payments can be made, correspondence and transfer of documents and other information can be carried out.

Electronic documents can contain information of a certain type, or mixed.

Figure 1.2 Types of electronic documents

It should be noted that there are more types of electronic documents, which indicates its undoubted advantage. However, the most common are text and graphic documents.

A text document is a sequence of characters (mostly printed characters belonging to a particular set of characters). These characters are usually grouped into strings. In modern systems, strings are separated by line separators, but in the past, strings were stored in the form of records of constant or variable length (for example, on punched cards). A text document can contain both formatted and unformatted text.

A graphic document is a file containing an information model of an object image, intended for use in a particular area of ​​computer graphics or specifically with a particular software tool. Each type of graphic document has a corresponding graphic file format. A graphics file format is a standardized data structure into which a graphic document is converted when written to a medium for subsequent processing and storage.

In documents of mixed type, two or more types of data presentation are combined. Their advantage is the ability to visualize document information for better understanding and assimilation.

Electronic document management

Document flow is the movement of documents in an organization from the moment of their creation or receipt until completion of execution or dispatch (GOST R 51141-98); complex of work with documents: reception, registration, distribution, control of execution, formation of files, storage and reuse of documentation, reference work.

Electronic document management (EDF) is a unified mechanism for working with documents submitted in in electronic format, with the implementation of the concept of “paperless office work”.

There are several types of electronic document processing:

General types of processing:

· Creation

· Editing

· Preservation

· Renaming

· Navigation along the route

· Placement in a folder

· Indexing

· Removal

Specialized types of processing:

· Expertise

· Approval, approval, signing

· Registration

· Classification

· Execution

· Control

· And others determined by specifics.

Processing of a document in electronic form is carried out as part of the documentation process within the limits of the access rights that are granted to employees who have roles in the document route.

In every industry or different types Activities in the processing of documents, both in paper and automated form, have their own specifics associated with differences in both the composition of the documents being processed and the processing procedures. In this regard, to automate specific documentation processes, they use different systems. There are also several types of document flow.

Figure 1.3 Types of document flow

Automated document flow is designed to solve a wide range of problems.

Figure 1.4 Automated document flow tasks

In addition to the above, modern EDMS also implements additional tasks that were previously performed by separate programs:

· Prompt exchange of information and documents - instead of Email and Internet pagers

· View the company's news feed, familiarize yourself with published documents - instead of Internet portals

· Work with an organizer and a list of personal assignments - instead of diaries

Based on the tasks facing the automation of documentation processes, a set of required functions is determined that the document management system must implement:

· Creation of document attribute cards

· Attaching files of arbitrary types to document cards

· Formation of document text from a predefined template with substitution of variable values ​​that can be in the document card or database

· Creation of document versions

· Formation of an image of an electronic document on a form using templates

·Saving a document image to a pdf or word file

·Manage user access rights

· Creating document routes

· Management of document movement along routes

· Maintaining document logs

· Maintaining directories and classifiers

· Registration and classification of documents registered in the EDMS

· Formation of orders

· Search for document cards

· Signing documents with an electronic digital signature

· Generating reports and statistics on the movement of documents

System-wide functions of automated document flow include:

· Opportunity remote work with documents via the Internet

· Using a DBMS information warehouse to store documents and metadata

· Possibility of simultaneous work with EDMS

· Ensuring information security

· Personal authentication of EDMS users

o Traffic encryption

o Role-based access control model

Today it is no longer surprising when a reader (user), coming to a library and spending a certain amount of time in it, not only does not use its collection, but does not even make an attempt to order literature. Users satisfy many of their needs at the information level, i.e. are quite satisfied with bibliographic, abstract, reference and other electronic information received via the Internet, using CD-ROM or other means. Moreover, the development of electronic resources leads to the fact that many libraries, not having the funds to acquire foreign publications, find the opportunity to organize access for their users to full-text electronic resources, and these trends are gaining momentum (electronic science Library The Russian Foundation for Basic Research, electronic publications of the Open Society Institute - Russia, etc. have provided free access to many libraries to several thousand titles of foreign journals). In addition, electronic delivery of documents is becoming increasingly widespread, which allows library users, without coming to the library itself, to order an electronic copy of an article, brochure, or fragment of a book from their workplace (or home) and receive it in a matter of moments. At the same time, it should be noted that already in many libraries the number of remote accesses (visits) to the library’s Internet site exceeds the number of regular physical visits.

All of the above confirms that old ideas about the library are crumbling before our eyes. The reader no longer always comes to the library for a document, he comes for information and/or an electronic resource. The library is gradually turning into an information center electronic resources, nevertheless retaining the functions of a regular library to serve users with printed documents. This new function of the modern library not only clearly illustrates one of the main trends in modern librarianship, it should also be taken into account both in planning the development of automated library and information technologies, and in general in revising the already outdated concept of the traditional library.

Technologies for creating an electronic document

Firstly, this technologies for analytical and synthetic information processing in ALIS.

Analytical and synthetic processing (ASP) are separate processes that include operations or groups of operations associated with the analysis of documents or data but do not lead to a significant change in their content, structure and appearance. In automated systems, ASO can be performed both in the form of manual, “human-machine”, and purely “machine” (i.e. using only automated means) operations. Examples of fully or partially automated implementation of ASO include operations of sorting, recording/rewriting, encoding, decoding, encryption/decryption, conversion, etc. Data processing is a generalized name for heterogeneous processes related to data. You can note some terms associated with types of data processing:

  • · Integrated data processing - the principle of organizing data processing in automated system, in which processes or operations previously performed in different organizations, divisions or sections of the technological chain are combined or optimized to improve the efficiency of the system. One possible goal of "integrated data processing" is the creation of integrated databases.
  • · Distributed data processing - data processing carried out in a distributed system, in which each of the technological or functional units systems can independently process local data and make appropriate decisions. When executing individual processes, nodes of a distributed system can exchange information through communication channels in order to process data or obtain analysis results that are mutual to them.
  • · Automated processing (data/documents) - processing (data or documents) performed automatically, without human intervention or with limited human involvement. The technical means for implementing “automated processing” can be computers or other devices or machines.
  • · Machine processing - performing operations on data using a computer or other data processing devices.
  • · Pre-machine processing, preparation of data for input - a stage of analytical-synthetic processing or document processing associated with the formalization of the final documents and recording their contents on a worksheet.
  • · Sorting - automatic or manual distribution of documents or data according to any specified criteria. A set of processes associated with bringing records in a file into compliance with latest changes in the subject area or new information (data) obtained. "Updating a file" involves performing the following operations: viewing records, adding new records, erasing (deleting) or correcting (editing) existing records.

Secondly, this information digitization technologies.

Information digitization technologies include:

1. Scanning.

This method is especially convenient for those who prepare multimedia applications based on existing printed materials and illustrations.

A scanner is a peripheral device of a personal computer that allows you to enter originals presented in the form of text documents, drawings, slides, photographs and so on. The scanner converts such information into electronic information.

Based on a general assessment of the characteristics of scanners, these devices can be divided into the following subgroups:

  • o Handheld scanners.
  • o Page (broached) scanners.
  • o Slide scanners.
  • o Projection scanners.
  • o Drum scanners.
  • 2. Enter data from the keyboard.

Keyboard data entry is a type of manual input. A technological means of providing keyboard input is a worksheet. Text input is done using text editor Word, which is included in the software package under common name Office, published by Microsoft Corporation. It is designed to run under the Windows operating system.

Word is a Windows application designed for creating, viewing, modifying and printing text documents, which makes it possible to perform all traditional operations on text, without exception, provided by modern computer technologies:

  • o typing and modification of unformatted alphanumeric information;
  • o formatting characters using a variety of True Type fonts of various styles and sizes;
  • o page formatting (including headers and footers);
  • o formatting the document as a whole (automatic creation of a table of contents and various indexes);
  • o spell checking, selection of synonyms and automatic word hyphenation.

Thirdly, this technologies for processing digitized documents.

Technologies for processing digitized documents include character recognition and their further editing.

Optical character recognition software - OCR (Optical Character Recognition) is designed for automatically entering printed documents into a computer. Most often, users use in practice the FineReader optical text recognition system produced by ABBYY and the CuneiForm optical character recognition system produced by Cognitive Technologies Ltd. Both systems have approximately equal capabilities and are designed to recognize texts written in almost any font (except hieroglyphs and Arabic), without prior training. A special feature of the programs is high accuracy of character recognition and low sensitivity to printing defects.

Software systems OCR works as follows. Let's say you have a text paper document with complex structure, i.e. In addition to text, the document contains tables, diagrams, illustrations, etc. You need to edit the text using a word processor. To solve this problem, you need to place a document in a scanner and create an electronic copy of it, which is a graphic image of the document.

Next you need to convert the image to text. This stage is very responsible, since the result of scanning is an image that can be saved exclusively in a file of one or another graphic format or loaded for processing not in a text editor, but in some kind of graphic editor. You can paste an image directly into a text editor. However, for text applications, the image is an indivisible element and cannot be identified. Thus, even if there is text on the image, it cannot be edited using a text editor. Therefore, first of all, it is necessary to convert the image of characters into text as such, i.e. into a sequence of characters that can be processed in text editors.

The FineReader and CuneiForm software systems allow you to solve problems of converting images into texts and are supplied in the form of complexes that are powerful enough for use in practical activities.

The process of text recognition by OCR systems is as follows.

First, you need to get a graphic image of the document, which can be done in two ways - scanning the document or loading an image from a file. The application software environment allows you to view images at various scales, as well as perform some transformations with them, in particular, rotate and invert them.

At the next stage of document processing, they are marked. The purpose of this operation is to tell the OCR system how text is positioned in the image.

The results of automatic marking can be adjusted manually - create and delete blocks of text, move them, set their sizes, divide them into adjacent blocks, make blocks polygonal by attaching block rectangles to them or cutting them out, etc. Blocks can be assigned serial numbers, which is very convenient if you need to convert complexly formatted text into simple text.

For texts printed blurry or on a non-uniform background, adaptive scanning is used, which allows, at the expense of some reduction in productivity, to increase the accuracy of determining the outlines of letters.

When recognizing low-contrast, faintly printed documents, the recognition quality can be improved by adjusting parameters such as brightness, contrast, and black and white point thresholds.

Markup and OCR settings are configured to ensure that the OCR system is able to correctly block and recognize text. In particular, you need to indicate in which language(s) the text to be recognized is written.

For operational work With recognized text, the OCR system integrates its own text editor, reminiscent of the WordPad application built into Windows. The text editor is capable of supporting such basic text formatting features as fonts and styles, superscripts and subscripts, tables, columns, and frames that can be moved on top of the text. In the recognized text, dubious words are highlighted with a certain background, and the text editor provides tools for quickly searching for dubious words, which greatly simplifies viewing and editing the recognized text.

Some versions of OCR systems, for example ABBYY FineReader Corporate Edition, contain integrated tools for teamwork. To implement the ability to work on a network, a separate copy of the program must be installed on each computer.

In this case, work with the same package on several computers can be organized. The system’s network tools have the ability to track the process of page processing - who is currently opening the page, scanning it, recognizing it, checking it, etc. Changes made to a page by one user are visible to everyone who works with the same package.

The CuneiForm 2000 Master system also has teamwork capabilities, which, in addition to the CuneiForm 2000 environment itself and a text editor, contains a built-in batch scanning and recognition software unit, as well as software for using scanners in local network.

Fourthly, this technologies for creating WEB documents.

HTML (HyperText Markup Language) is often used to create Web documents. This concept includes various ways design of hypertext documents, design, hypertext editors, browsers and much more. Hypertext is ideal for incorporating multimedia elements into traditional documents. In practice, it was thanks to the development of hypertext that most users were able to create their own multimedia products and distribute them on CDs. Such information systems, designed as sets of HTML pages, do not require the development of special software, since all necessary tools for working with data have become part of the standard software on most personal computers. With this approach, the user is required to perform only that work that directly relates to the topic of the product being developed: prepare texts, draw pictures, create HTML pages and think through the connections between them.

Topic 2.4. Database management systems and expert systems

2.4.11. Training database with the main button form "Training_students" - Download

Microsoft Word - word processor

2.1. Processing text information in Microsoft Word

2.1.2. Electronic document concept. Technology for creating and editing a text document in Microsoft Word

Electronic document concept

For effective management processes in enterprises and organizations require an effective system electronic document management.

Documents are the main information resource enterprises and organizations.

Document flow is a continuous process of movement of documents, which reflects the activities of enterprises and allows you to quickly manage production processes at the enterprise.

Currently, both traditional paperwork (on paper) and electronic document management are used. Electronic document management systems are designed to automate office processes.

A comprehensive solution for organizing electronic document management at an enterprise is provided by the electronic document management and office automation system “EVFRAT-Document Flow”. The office automation and electronic document management system DELO supports both traditional office management and electronic document management.

FossDoc - Electronic document management system. Electronic document management system - standard solution FossDoc platform, designed to automate document flow and office work, both in state and non-state enterprises of any size and type of activity.

The basic concept of an electronic document management system is an electronic document. An electronic document is a collection of data in computer memory, which includes text, pictures, tables, drawings, etc. and is intended to be perceived by humans using appropriate software and hardware.

The status of an electronic document is enshrined in the Law of Ukraine “On Electronic Documents and Electronic Document Flow” No. 851-IV dated May 22, 2003. An electronic document is a document in which information is recorded in the form of electronic data, including the mandatory details of the document.

Mandatory details of electronic documents include an electronic signature. Electronic digital signature (EDS) is used to sign electronic documents by both physical and legal entities in order to give the document legal force.

2.1.3. Technology for creating and editing a text document in Microsoft Word 2003

A text editor is a program that allows you to create a document with text data.

A word processor is a program for entering, editing and processing text documents with various information (for example, tables, graphics, etc.).

Text editors and processors include: built-in text editors; editors of instrumental computer programs; universal word processors; editors of scientific documents, programs for layout of magazines and books (publishing systems).

Currently, the most popular word processor is Microsoft Word 2003, but it has already appeared a new version Microsoft Office 2007.

In Word 2003, you can create the following types of documents:

  1. New document.
  2. Web page.
  3. XML document.
  4. Message E-mail.
  5. Message – Fax.
  6. Envelopes and Stickers.
  7. Templates.

Getting started with Word

All new text documents in Word are created based on templates. A template is a document that is used as a template for creating new text documents. After starting Word, a window appears on the screen showing a blank text document based on the Normal template.

By default, all text documents in Word are created based on the Normal Normal.dot template, and all text is entered in the Normal Normal style, in which the basic formatting parameters are set: font - Times New Roman, font size - 10 points, left alignment, single line spacing.

You can enter text into the empty text document window, insert a table, a picture, etc. The empty document is named Document 1, which is visible in the title bar.

You can create a text document in Word in the following ways:

  • select the New command from the File menu, then in the task area (the mode will change to “Create Document”) in the Create section, select “New Document” based on the Normal template or the Templates section in the On my computer item, select a predefined template or wizard based on which you need to create a document or template;
  • Clicking the New icon on the Standard toolbar opens a blank document, Document 1, based on the Standard template.

When creating new documents, they will be sequentially assigned numbers 2, 3, 4, 5, etc. In a blank document, you can see a flickering vertical line at the top left, which is called the input cursor. The cursor indicates where a character will be entered from the keyboard or an object (drawing, table, etc.) will be inserted.

The cursor position can be changed using the cursor control keys or the mouse manipulator only within the typed text. In the editor status line on the display screen you can see the document line number and the position number in it where the cursor is currently located.

Word mode - insert or replace

The Word word processor operates in insert or replace mode. The mode is indicated in the status line on the display screen. Modes can be switched using the Ins key. In the “insert” mode, characters entered from the keyboard shift the document text located behind the cursor to the right. In replacement mode, instead of the character located to the right of the cursor, a new character is entered from the keyboard.

Before entering text, it is advisable to set the parameters and the required page orientation by executing the command File / Page Setup, Margins tab.

Saving a Word document

While working with a document, it is in RAM. In order to write this file to disk, you must select the Save command from the File menu and set the necessary parameters in the Save Document dialog box.


Rice. 1.

When saving a document in the dialog box, you must: Specify the file name, select the File Type, select the folder in which the file attributes are saved, and the disk on which the file will be saved. Open file You can save it to disk under a different name by selecting "Save As" from the File menu.

Close a document and exit Word

To close a document rather than an application, you must select the "File/Close" command.

To exit the program, you can use one of the following methods:

  • select the Exit command from the File menu;
  • click the Close button on the title bar;
  • double-click the window menu button;
  • select the Close command in the window menu.

Open a document in Word

You can open a document in Word in several ways:

  1. Select the Open command from the File menu and in the Open Document dialog box, specify the name and location of the file;
  2. Click the Open button on the Standard toolbar and double-click the file icon in the Open Document dialog box.

To open several files at the same time, select their names in the dialog window, hold down the Ctrl key, and then click the Open button.

Multi-window mode in Word

Word processors can work simultaneously with multiple documents in different windows. Entering and editing text is carried out in the active window, in which you can access menu commands. Commands in the Window menu allow you to arrange document windows, move from one window to another, and divide the work area of ​​a window into two parts.

The main stages of Creating (preparing) text documents in Word are:

  • typing;
  • text editing;
  • formatting;
  • spell checking;
  • text printing;
  • preservation.

Each stage consists of performing certain operations.

You can enter text by typing it on the keyboard and insert various text fragments from other documents into the document.

Typing is carried out by pressing keys on the PC keyboard, while the next character is displayed on the screen at the cursor position, and the cursor moves one position to the right. Spaces that a text editor automatically inserts to align lines are called “soft” spaces.

“Hard” spaces are added to the text when you press the Space key on your PC keyboard. A sign of separation of a word from a word is a “Hard” space, so you must set one “Hard” space between words. A punctuation mark does not need to be separated by a space from the preceding word, and a space must be entered after a punctuation mark. A “hard” line terminator is created by pressing the Enter key to indicate the end of a paragraph.

Editing text in Word

After you enter text, it undergoes various changes. Editing a document in Word is carried out using menu commands or pressing keys on the PC keyboard.

Editing operations (delete, insert, move) are performed on:

  1. Symbols.
  2. In lines.
  3. In fragments.

Characters are entered from the keyboard (in insert or replace mode), and the Backspace or Delete keys are used to delete characters.

Editing operations for lines are: deleting a line, splitting one line into two, merging two lines into one, inserting an empty line.

Editing operations (copy, move and delete) for fragments. A fragment is a continuous piece of text. To delete, copy, or move a fragment, you need to select it.

You can select a fragment using the mouse or keyboard. Word uses different highlighting methods for words, lines, sentences, and paragraphs. (single, double or triple click on a paragraph or to the left of a paragraph in the selection bar). Selected text can be copied, moved and deleted either using the clipboard or by moving it with the mouse (while pressing the left or right key).

After selecting a fragment, the operation of copying or moving it can be performed in the following ways:

  • using the left mouse button (drag);
  • using the right mouse button (drag and drop);
  • using the Edit menu commands;
  • using context menu commands;
  • using icons on the toolbar (cut, copy, paste);
  • from the keyboard (set of keys).

To undo an erroneous action in Word, the rollback operation is used. The Undo command is found on the Edit menu and on the toolbar. In addition to the rollback command, there is a rollback cancel command.

Editing existing text in Word using the Edit/Replace command

The ability to search and replace text provided in Word significantly speeds up the process of editing (editing) large text. You can also use this command to find and replace specific formatting options, special characters, and other document objects.



Rice. 2.

The main source defining the features of working with electronic documents in federal executive authorities is the “Rules for office work in federal executive authorities”, section V1 (approved by Government Resolution Russian Federation dated June 15, 2009 No. 477).

The conditions defining the features of working with electronic documents in federal executive authorities should be reflected in the Instructions for Office Work federal body executive power.

The Instructions for Office Work in the Federal Executive Body establishes the procedure for creating, receiving, processing, storing and using electronic documents in the context of using a departmental electronic document management system, as well as the procedure for sending, receiving and registering electronic messages.

Reception of electronic documents from other government bodies and organizations and sending of electronic documents is carried out by the Office Management Service of the federal executive body.

Electronic documents are created, processed and stored in the electronic document management system of the federal executive body. Documents created by the federal executive body and received by the federal executive body on paper are included in the electronic document management system after scanning and creating electronic images of the documents.

The inclusion of an electronic image of a document in the electronic document management system, possibly after its verification (comparing the electronic image of a document with the original document), confirmation of compliance of the electronic image with the original document is carried out using electronic signature employee of the Office Management Service performing this procedure.

Electronic messages received through the interdepartmental electronic document management system are registered in the electronic document management system of the federal executive body in the manner established by the Instructions for Office Management. The electronic document management system of the federal executive body must provide for registration (accounting) and inclusion in the system of electronic messages sent and received through the interdepartmental electronic document management system.



When uploading an electronic message into the electronic document management system of a federal executive body, the details contained in the corresponding XML file of the electronic message can be used to fill out the fields of the registration and control card of the received document in electronic form.

The instructions for office work should stipulate that the preparation, execution and approval of draft electronic documents is carried out according to general rules office work established in relation to similar documents on paper. An electronic document must have the details established for a similar document on paper, with the exception of a seal impression.

To confirm the authenticity of electronic documents, the federal executive body uses electronic digital signatures.

The electronic digital signature tools used must be certified in the prescribed manner.

When receiving electronic documents from other authorities or organizations, the Office Management Service of the federal executive body verifies the authenticity of the electronic digital signature.

When reviewing and agreeing on electronic documents, as well as when signing internal information and reference documents (reports, memos, certificates, summaries, etc.) created in electronic form, the electronic document management system of the federal executive body can use methods for confirming actions with electronic documents in which an electronic digital signature is not used. Such rules can be established provided that the software used in a given federal executive body makes it possible to unambiguously identify the person who signed or endorsed the document.

Mandatory information about the electronic document

In order to record and search for documents in the electronic document management system of the federal executive body, in accordance with the Rules of office work in the federal executive bodies, mandatory information about the document is used, included in the registration and control card in accordance with the table below.

Name of information about the document Characteristics of information included in the registration and control card
1. Addressee Full official and abbreviated name of the organization or surname, initials of the person sending the document (based on the document form or in accordance with the data specified in the citizen’s application)
2. Addressee Full official and abbreviated name of the organization or surname, initials of the person receiving the document (in accordance with the “Addressee” requisite)
3. Position, surname and initials of the person signing the document In accordance with the “Signature” requisite
4. Document type In accordance with the indication of the type of document on the document form or based on an assessment of the content of the document
5. Document date In accordance with the date indicated in the document by the author or based on the postmark on the envelope, if there is no date in the document
6. Document number In accordance with the number assigned to the document by the author
7. Date of receipt of the document The date indicated in the receipt mark (incoming stamp)
8. Incoming document number Number affixed to the receipt mark (incoming stamp)
9. Link to the outgoing number and date of the document In accordance with the information specified in the relevant details
10. Title of the text Summary document (heading to text)
11. Case index Case index according to the nomenclature of cases, indicating the location of the document storage
12. Information about document forwarding Based on the resolution on the document (the “Instructions for execution” detail)
13. Number of sheets of the main document Number of sheets of the main document
14. Number of applications Number of applications
15. Total number of application sheets Total number of application sheets
16. Instructions for executing the document A resolution, or instruction from a manager, determining the nature of the execution of the document and the deadline for execution
17. Position, surname and initials of the performer Position, surname and initials of the performer
18. Privacy Notice In accordance with the “Document access restriction stamp” (“For official use”, “Confidential”, “Trade secret”, etc.)

The instructions for office work may provide for the inclusion of additional information about documents in the electronic document management system of the federal executive body. Additional information about an electronic document may include information: the name of the author of the document (if the author does not coincide with the addressee (sender) of the electronic document), a note on the postponement of the document's execution date, the document's shelf life, names of attachments to the electronic document, keywords, etc. .

Topic 29. Technologies for creating electronic documents

Today it is no longer surprising when a reader (user), coming to a library and spending a certain amount of time in it, not only does not use its collection, but does not even make an attempt to order literature. Users satisfy many of their needs at the information level, i.e. are quite satisfied with bibliographic, abstract, reference and other electronic information received via the Internet, using CD-ROM or other means. Moreover, the development of electronic resources leads to the fact that many libraries, not having the funds to acquire foreign publications, find the opportunity to organize access for their users to full-text electronic resources, and these trends are gaining momentum (electronic scientific library of the Russian Foundation for Basic Research, electronic publications of the Open Society Institute " – Russia and others have provided free access to many libraries to several thousand titles of foreign journals). In addition, electronic delivery of documents is becoming increasingly widespread, which allows library users, without coming to the library itself, to order an electronic copy of an article, brochure, or fragment of a book from their workplace (or home) and receive it in a matter of moments. At the same time, it should be noted that already in many libraries the number of remote accesses (visits) to the library’s Internet site exceeds the number of regular physical visits.
All of the above confirms that old ideas about the library are crumbling before our eyes. The reader no longer always comes to the library for a document, he comes for information and/or an electronic resource. The library is gradually turning into a center of information electronic resources, while retaining, nevertheless, the functions of a regular library for serving users with printed documents. This new function of the modern library not only clearly illustrates one of the main trends in modern librarianship, it should also be taken into account both in planning the development of automated library and information technologies, and in general in revising the already outdated concept of the traditional library.

Technologies for creating an electronic document

Firstly, this technologies of analytical and synthetic information processing in ALIS.
Analytical-synthetic processing (ASP) are separate processes that include operations or groups of operations associated with the analysis of documents or data but do not lead to a significant change in their content, structure and appearance. In automated systems, ASO can be performed both in the form of manual, “human-machine”, and purely “machine” (i.e. using only automated means) operations. Examples of fully or partially automated implementation of ASO include operations of sorting, recording/rewriting, encoding, decoding, encryption/decryption, conversion, etc. Data processing is a generalized name for heterogeneous processes related to data. You can note some terms associated with types of data processing:

  • Integrated data processing is the principle of organizing data processing in an automated system, in which processes or operations previously performed in various organizations, divisions or sections of the technological chain are combined or optimized in order to increase the efficiency of the system. One possible goal of "integrated data processing" is the creation of integrated databases.
  • Distributed data processing is data processing carried out in a distributed system, in which each of the technological or functional nodes of the system can independently process local data and make appropriate decisions. When executing individual processes, nodes of a distributed system can exchange information through communication channels in order to process data or obtain analysis results that are mutual to them.
  • Automated processing (data/documents) – processing (data or documents) performed automatically, without or with limited human intervention. The technical means for implementing “automated processing” can be computers or other devices or machines.
  • Machine processing - performing operations on data using a computer or other data processing devices.
  • Pre-machine processing, preparation of data for input - a stage of analytical and synthetic processing or document processing associated with the formalization of the final documents and recording their content on a worksheet.
  • Sorting is the automatic or manual distribution of documents or data according to any specified criteria. A set of processes associated with bringing records in a file into compliance with the latest changes in the subject area or new information (data) received. "Updating a file" involves performing the following operations: viewing records, adding new records, erasing (deleting) or correcting (editing) existing records.

Secondly, this information digitization technologies.
Information digitization technologies include:

1.Scanning.
This method is especially convenient for those who prepare multimedia applications based on existing printed materials and illustrations.
A scanner is a peripheral device of a personal computer that allows you to enter originals into the computer, presented in the form of text documents, drawings, slides, photographs, and so on. The scanner converts such information into electronic information.
Based on a general assessment of the characteristics of scanners, these devices can be divided into the following subgroups:

o Handheld scanners.

o Page (broached) scanners.

o Slide – scanners.

o Projection scanners.

o Drum scanners.

2.Entering data from the keyboard.

Keyboard data entry is a type of manual entry. A technological means of providing keyboard input is a worksheet. Text entry is carried out using the Word text editor, which is included in the software package under the general name Office, released by Microsoft Corporation. It is designed to run under the Windows operating system.
Word is a Windows application designed for creating, viewing, modifying and printing text documents, which makes it possible to perform all traditional operations on text, without exception, provided by modern computer technologies:

o typing and modification of unformatted alphanumeric information;

o formatting characters using a variety of True Type fonts of various styles and sizes;

o page formatting (including headers and footers);

o formatting the document as a whole (automatic creation of a table of contents and various indexes);

o spell checking, selection of synonyms and automatic word hyphenation.

Thirdly, this technologies for processing digitized documents.
Technologies for processing digitized documents include character recognition and their further editing.
Optical character recognition software - OCR (Optical Character Recognition) is designed for automatically entering printed documents into a computer. Most often, users use in practice the FineReader optical text recognition system produced by ABBYY and the CuneiForm optical character recognition system produced by Cognitive Technologies Ltd. Both systems have approximately equal capabilities and are designed to recognize texts written in almost any font (except hieroglyphs and Arabic), without prior training. A special feature of the programs is high accuracy of character recognition and low sensitivity to printing defects.
OCR software systems work as follows. Let's say you have a text paper document with a complex structure, i.e. In addition to text, the document contains tables, diagrams, illustrations, etc. You need to edit the text using a word processor. To solve this problem, you need to place a document in a scanner and create an electronic copy of it, which is a graphic image of the document.
Next you need to convert the image to text. This stage is very important, since the result of scanning is an image that can be saved exclusively in a file of one or another graphic format or loaded for processing not into a text editor, but into some kind of graphic editor. You can paste an image directly into a text editor. However, for text applications, the image is an indivisible element and cannot be identified. Thus, even if there is text on the image, it cannot be edited using a text editor. Therefore, first of all, it is necessary to convert the image of characters into text as such, i.e. into a sequence of characters that can be processed in text editors.
The FineReader and CuneiForm software systems allow you to solve problems of converting images into texts and are supplied in the form of complexes that are powerful enough for practical use.
The process of text recognition by OCR systems is as follows.
First, you need to get a graphic image of the document, which can be done in two ways - by scanning the document or loading an image from a file. The application software environment allows you to view images at various scales, as well as perform some transformations with them, in particular, rotate and invert them.
At the next stage of document processing, they are marked. The purpose of this operation is to tell the OCR system how text is positioned in the image.
The results of automatic marking can be adjusted manually - create and delete blocks of text, move them, set their sizes, divide them into adjacent blocks, make blocks polygonal by attaching block rectangles to them or cutting them out, etc. Blocks can be assigned serial numbers, which is very convenient if you need to convert complexly formatted text into simple text.
For texts printed blurry or on a non-uniform background, adaptive scanning is used, which allows, at the expense of some reduction in productivity, to increase the accuracy of determining the outlines of letters.
When recognizing low-contrast, faintly printed documents, the recognition quality can be improved by adjusting parameters such as brightness, contrast, and black and white point thresholds.
Markup and OCR settings are configured to ensure that the OCR system is able to correctly block and recognize text. In particular, you need to indicate in which language(s) the text to be recognized is written.
To quickly work with recognized text, the OCR system integrates its own text editor, reminiscent of the WordPad application built into Windows. The text editor is capable of supporting such basic text formatting features as fonts and styles, superscripts and subscripts, tables, columns, and frames that can be moved on top of the text. In the recognized text, dubious words are highlighted with a certain background, and the text editor provides tools for quickly searching for dubious words, which greatly simplifies viewing and editing the recognized text.
Some versions of OCR systems, for example ABBYY FineReader Corporate Edition, contain integrated tools for teamwork. To implement the ability to work on a network, a separate copy of the program must be installed on each computer.
In this case, work with the same package on several computers can be organized. The system’s network tools have the ability to track the process of page processing - who is currently opening the page, scanning it, recognizing it, checking it, etc. Changes made to a page by one user are visible to everyone who works with the same package.
The CuneiForm 2000 Master system also has teamwork capabilities, which, in addition to the CuneiForm 2000 environment itself and a text editor, contains a built-in batch scanning and recognition software unit, as well as software for using scanners on a local network.

Fourthly, this technologies for creating WEB documents.
HTML (HyperText Markup Language) is often used to create Web documents. This concept includes various ways of designing hypertext documents, design, hypertext editors, browsers and much more. Hypertext is ideal for incorporating multimedia elements into traditional documents. In practice, it was thanks to the development of hypertext that most users were able to create their own multimedia products and distribute them on CDs. Such information systems, designed as sets of HTML pages, do not require the development of special software, since all the necessary tools for working with data have become part of the standard software of most personal computers. With this approach, the user is required to perform only that work that is directly related to the topic of the product being developed: prepare texts, draw pictures, create HTML pages and think through the connections between them.

Electronic publications and electronic libraries
Computer technology makes it possible to store and distribute information in electronic form, which plays a revolutionary role in the history of mankind, similar to the invention of printing. Electronic form allows you to store information most reliably and compactly, distribute it more quickly and widely and, in addition, provides opportunities for its use that could not be available in other forms. The main means for realizing these opportunities are electronic publications (EP) and electronic electronic libraries (EL).
Naturally, in order to perform their functions, EI and EB must actually be implemented in the form of some information systems whose distinctive properties are long-term storage and use of information, in particular its dissemination. Information systems that work with dynamic information, as well as accumulate information but do not intend to distribute it (archive-type systems), cannot be classified in this class. Thus, EI and EB occupy a certain position among other information systems.
Electronic edition– an electronic document (group of documents) that has undergone editorial and publishing processing, intended for distribution in unchanged form, with output information.
EI consists of information and software that allows you to visualize this information, use it effectively, but not change it. Thus, EI is an information system in which there are no functions for correcting existing information (deleting, adding, etc.). The main purpose of the EI information system is to reproduce the original information as accurately as possible and provide the user with the means to solve a certain set of functional tasks.
EI are replicable products and can be distributed on any machine-readable media (compact optical disks, magnetic floppy disks, etc.) or through telecommunication networks.
It should be borne in mind that it extends Information system, not just information. If the information is used using other software, then some of it may be lost, all its properties will not be realized, and the range of user capabilities may be significantly reduced.
The field of activity related to digital libraries is quite new and therefore does not yet have a stable terminology. (Of course, it should not be identified with the automation of traditional library processes, although it is hardly possible to draw an exact boundary between them.)
Work on digital libraries received significant development at the turn of the 1990s, when adequate computer technology and information Technology, ensuring reliable storage, prompt processing and efficient use large arrays of heterogeneous information, primarily textual. It was at that time that projects began to be prepared in a number of countries electronic libraries. Some of them formed the basis of relevant national and international programs.