Skip to content
Change the repository type filter

All

    Repositories list

    54 repositories

    • devops tools
      Shell
      Apache License 2.0
      0000Updated Jul 26, 2024Jul 26, 2024
    • sous-chef

      Public
      Configurable Data Analytics Pipeline
      Python
      0180Updated Jul 25, 2024Jul 25, 2024
    • Internal API server that offers search access to the Media Cloud Online News Archive (in Elasticsearch).
      Python
      GNU Affero General Public License v3.0
      3161Updated Jul 24, 2024Jul 24, 2024
    • Internal library to allow querying multiple media platforms with a consistent API.
      Python
      1030Updated Jul 24, 2024Jul 24, 2024
    • The core pipeline used to ingest online news stories in the Media Cloud archive.
      Python
      Apache License 2.0
      41312Updated Jul 24, 2024Jul 24, 2024
    • Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.
      JavaScript
      Apache License 2.0
      129361Updated Jul 23, 2024Jul 23, 2024
    • sc-buffet

      Public
      Sous-chef buffet - Self-service data access for sous-chef.
      Python
      0050Updated Jul 22, 2024Jul 22, 2024
    • Daily performance metrics for the mediacloud application
      Python
      0010Updated Jul 5, 2024Jul 5, 2024
    • mc-manage

      Public
      Python
      0000Updated Jun 25, 2024Jun 25, 2024
    • An internal client library to access the new Mediacloud news archive search.
      Python
      Apache License 2.0
      2031Updated Jun 14, 2024Jun 14, 2024
    • Intelligently fetch lists of URLs from a large collection of RSS Feeds as part of the Media Cloud Directory.
      Python
      Apache License 2.0
      55141Updated Jun 2, 2024Jun 2, 2024
    • A Python client for the CLIFF geoparsing tool
      Python
      MIT License
      5501Updated May 21, 2024May 21, 2024
    • Public client for consuming content from the Media Cloud Online News Archive & Directory.
      Python
      Apache License 2.0
      246811Updated May 7, 2024May 7, 2024
    • How Media Cloud approaches extracting metadata from online news stories
      Python
      Apache License 2.0
      31130Updated May 7, 2024May 7, 2024
    • A client library to access the Wayback Machine news archive search.
      Python
      Apache License 2.0
      2410Updated Dec 15, 2023Dec 15, 2023
    • web-tools

      Public archive
      The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)
      JavaScript
      Apache License 2.0
      3063314Updated Dec 14, 2023Dec 14, 2023
    • A set of jupyter notebooks demonstrating how to use the Media Cloud API.
      Jupyter Notebook
      143300Updated Dec 13, 2023Dec 13, 2023
    • backend

      Public archive
      Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.
      Python
      GNU Affero General Public License v3.0
      8727713125Updated Nov 20, 2023Nov 20, 2023
    • Dokku app that serves a static HTML catch-all page, displayed for bad domains
      HTML
      0000Updated Oct 25, 2023Oct 25, 2023
    • A simple homepage for the CLIFF project
      HTML
      MIT License
      1100Updated May 30, 2023May 30, 2023
    • Ultimate Website Sitemap Parser
      Python
      Other
      64174184Updated May 17, 2023May 17, 2023
    • Tag news stories based on models trained on the NYT corpus.
      Python
      Apache License 2.0
      123916Updated Mar 1, 2023Mar 1, 2023
    • Find rss, atom, xml, and rdf feeds on webpages
      Python
      MIT License
      123141Updated Feb 27, 2023Feb 27, 2023
    • Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
      Python
      Other
      3022322Updated Nov 7, 2022Nov 7, 2022
    • glimpse

      Public archive
      Get a glimpse of attention to a topic on social media.
      Python
      Apache License 2.0
      2280Updated Sep 19, 2022Sep 19, 2022
    • Helpful micro-service to return results from word2vec models
      Python
      MIT License
      4200Updated Jul 29, 2022Jul 29, 2022
    • cliff-annotator

      Public archive
      A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.
      Java
      Apache License 2.0
      34119910Updated May 20, 2022May 20, 2022
    • Notebook demonstrating how to create and update a Media Cloud collection.
      Jupyter Notebook
      0000Updated Mar 30, 2022Mar 30, 2022
    • Temporal server configuration
      0000Updated Jan 4, 2022Jan 4, 2022
    • PostgreSQL built for AWS Graviton2
      1200Updated Dec 30, 2021Dec 30, 2021