Knowledge discovery / Symbolic computation

URI permanente para esta colección:

Artículos en la categoría Knowledge discovery / Symbolic computation publicados en las Actas de las XX Jornadas de Programación y Lenguajes (PROLE 2021).
Notificar un error en esta colección

Examinar

Envíos recientes

Mostrando 1 - 3 de 3
  • Artículo
    Variant-based Equational Unification under Constructor Symbols
    Aparicio Sanchez, Damian; Escobar, Santiago; Sapiña, Julia. Actas de las XX Jornadas de Programación y Lenguajes (PROLE 2021), 2021-09-22.
    Publicado en The 36th International Conference on Logic Programming (ICLP 2020).
  • Artículo
    An Optimizing Protocol Transformation for Constructor Finite Variant Theories in Maude-NPA
    Aparicio Sanchez, Damian; Escobar, Santiago; Gutiérrez, Raúl; Sapiña, Julia. Actas de las XX Jornadas de Programación y Lenguajes (PROLE 2021), 2021-09-22.
    Publicado en 25th European Symposium on Research in Computer Security (ESORICS 2020).
  • Artículo
    Page-Level Main Content Extraction from Heterogeneous Webpages
    Alarte, Julián; Sílva, Josep. Actas de las XX Jornadas de Programación y Lenguajes (PROLE 2021), 2021-09-22.
    The main content of a webpage is often surrounded by other boilerplate elements related to the template, such as menus, advertisements, copyright notices, comments, etc. For crawlers and indexers, isolating the main content from the template and other noisy information is an essential task, because processing and storing noisy information produce a waste of resources such as bandwidth, storage space, computing time, etc. Besides, the detection and extraction of the main content is useful in different areas, such as data mining, web summarization, content adaptation to low resolutions, etc. This work introduces a new technique for main content extraction. In contrast to most techniques, this technique not only extracts text, but also other types of content, such as images, animations, etc. It is a DOM-based page-level technique, thus it only needs to load one single webpage to extract the main content. As a consequence, it is efficient enough as to be used online (in real-time). We have empirically evaluated the technique using a suite of real heterogeneous benchmarks producing very good results compared with other well-known content extraction techniques. Publicado en: ACM Transactions on Knowledge Discovery from Data. Año 2021