Implementación de un sistema predictivo para grandes volúmenes de datos bajo tecnologías Big Data

Contento Ortega, Freddy Iván

Please use this identifier to cite or link to this item: http://dspace.utpl.edu.ec/handle/20.500.11962/23632

Title:	Implementación de un sistema predictivo para grandes volúmenes de datos bajo tecnologías Big Data
Authors:	Contento Ortega, Freddy Iván
Director:	Tenesaca Luna, Gladys Alicia
Keywords:	Ecuador. Tesis digital.
Issue Date:	2018
Citation:	Contento Ortega, F. I. Tenesaca Luna, G. A. (2018) Implementación de un sistema predictivo para grandes volúmenes de datos bajo tecnologías Big Data [Tesis de Grado, Universidad Técnica Particular de Loja]. Repositorio Institucional. https://dspace.utpl.edu.ec/handle/20.500.11962/23632
Abstract:	Abstract:The present project is oriented at implementing a Recommendation Web Service based on collaborative filtering for large volumes of data on Big Data platforms that allows the rapid processing of information and supports decision making. Currently on the Web, there are few automated processes to dynamically serve personalized content of products based on the preferences and interests of users and to facilitate the decision to choose one or the other product. We experimented with the MovieLens data set, the implementation of the ALS collaborative filtering algorithm of Apache Spark MLlib to make online movie recommendations using a RESTful API with Flask and, Hadoop HDFS. To evaluate the behavior of the algorithm load tests are performed to the web service obtaining recommendations of top movies, individual ratings predicted for a certain movie by adding ratings to movies by users. The results show how he is able to make recommendations online providing quick answers, but he faces the problem of cold start for new users with times of 2, 25 and 95 min.
Description:	Resumen:El presente trabajo de titulación está orientado a implementar un Servicio Web de Recomendación basado en filtrado colaborativo para grandes volúmenes de datos sobre plataformas Big Data que permita el rápido procesamiento de la información y apoye a la toma de decisiones. Actualmente en la Web, existen pocos procesos automatizados para servir dinámicamente contenido personalizado de productos en base a las preferencias e intereses de los usuarios y que facilite la decisión de elegir uno u otro producto. Se experimenta con el conjunto de datos MovieLens, la implementación del algoritmo de filtrado colaborativo ALS de Apache Spark MLlib para realizar recomendaciones de películas en línea mediante una API RESTful con Flask y, Hadoop HDFS. Para evaluar el comportamiento del algoritmo se realizan pruebas de carga al servicio web obteniendo recomendaciones de películas top, calificaciones individuales predichas para cierta película agregando calificaciones a películas por usuarios. Los resultados muestran como es capaz de realizar recomendaciones en línea proporcionando respuestas rápidas, pero se enfrenta al problema del arranque en frío para usuarios nuevos con tiempos de 2, 25 y 95 min.
Identifier :	Cobarc: 1286404
URI:	https://bibliotecautpl.utpl.edu.ec/cgi-bin/abnetclwo?ACC=DOSEARCH&xsqf99=120603.TITN.
Type:	bachelorThesis
Appears in Collections:	Ingeniero en Sistemas Informáticos y Computación

Files in This Item:

8b6d51d6-07ae-4e2b-b7d1-60bea359d5df

Show full item record