Implementación del framework Apache Flink para el procesamiento de grandes cantidades de datos en tiempo real

Solano Rivera, Elder Fidel

Please use this identifier to cite or link to this item: http://dspace.utpl.edu.ec/handle/20.500.11962/26651

Title:	Implementación del framework Apache Flink para el procesamiento de grandes cantidades de datos en tiempo real
Authors:	Solano Rivera, Elder Fidel
Director:	Elizalde Solano, René Rolando
Keywords:	Ecuador. Tesis digital.
Issue Date:	2020
Citation:	Solano Rivera, E. F. Elizalde Solano, R. R. (2020) Implementación del framework Apache Flink para el procesamiento de grandes cantidades de datos en tiempo real [Tesis de Grado, Universidad Técnica Particular de Loja]. Repositorio Institucional. https://dspace.utpl.edu.ec/handle/20.500.11962/26651
Abstract:	Abstract: In the current era there is the production of large amounts of information that are from different sources such as: banks, entities, businesses, web pages, social networks, among others, with social networks being the ones that produce the greatest volume of information. The destination of these large volumes of information is storage and backup, causing that there is no adequate use for the processing and extraction of information quickly and reliably. In this degree work, the implementation of the Apache Flink Framework is carried out, which integrates the DataStream API which allows the processing of data flows in real time, using operators and functions of this API to comply with this type of processing and show results automatically. The implementation of the operating environment of this tool is carried out in a single node or host, and through the use of the different test scenarios proposed in this project, it is possible to determine that Apache Flink performs the processing of data flows in a efficient and meets planned expectations.
Description:	Resumen: En la época actual existe la producción de grandes cantidades de información que son procedentes de diferentes fuentes como: bancos, entidades, negocios, páginas web, redes sociales, entro otros, siendo las redes sociales las que mayor volumen de informaciónproducen. El destino de estos grandes volúmenes de información es el almacenamiento y respaldo, provocandoque no exista un uso adecuado para el procesamiento y extracción de información de manera rápida y fiable. En el presente trabajo de titulación se realiza la implementación del Framework Apache Flink, que integra elAPI DataStream la cualpermite realizar el procesamiento de flujos de datos en tiempo real, utilizando operadores y funciones propias de esta API para cumplir con este tipo de procesamiento y mostrar resultados de forma automática. La implementación del entorno de operación de esta herramientase la realiza en un solo nodo o host, y mediante el uso de los diferentes escenarios de prueba planteados en el presente proyecto, se logradeterminar que Apache Flink realiza el procesamiento de flujos de datos de manera eficiente y cumple con las expectativas planificadas.
Identifier :	Cobarc: 1345230
URI:	https://bibliotecautpl.utpl.edu.ec/cgi-bin/abnetclwo?ACC=DOSEARCH&xsqf99=124188.TITN.
Type:	bachelorThesis
Appears in Collections:	Ingeniero en Sistemas Informáticos y Computación

Files in This Item:

f75a082c-8818-40b5-90ff-ebbd39bfd48a

Show full item record