Newser is a simple utility to generate a pdf with you favorite news articles

Last update: Nov 9, 2022

Comments: 0

Newser

A simple utility to crawl some news sites or other resources and download content into a pdf

Building

Make sure you have config.yaml setup and go available, then run go build cmd/newser.go or just run it from source with go run cmd/newser.go

Configuration

Configuration file is used to guide the pdf building process, right now only website parsing is supported.

The configuration file must have a top level defs (definitions), font and output properties. Right now defs must have a website property that contains website definitions.

Default config is part of the source repo.

Website Definitions

-   index: "index-page-url"
    indexSelector: "css-selector-for-articles-index"
    titleSelector: "title-selector-for-articles"
    linkSelector: "selector-for-the-link-for-the-article-content"
    linkAttr: "attribute-to-gather-from-link-selector"
    articleContainerSelector: "article-container-selector"
    articleContentSelector: "article-content-selector"
    ignoreString: "if-found-in-article-article-will-be-ignored"
    removeElems:
        - "selector-in-article-html-to-remove"
        - "someother-selector-in-article-html-to-remove"
    collectOnly: 0 # 0 if you want to collect all articles, or limit to N articles
    disable: 0 # 1 if you want to disable this entry

The good thing is you can be as specific with selectors as you want. So if a website has multiple sections that contain articles, you can have multiple definitions for it and only get the articles that you want.

Deps

Top level deps are

fpdf - "github.com/go-pdf/fpdf" - For generating pdfs
yaml - "gopkg.in/yaml.v2" - For parsing yamls
colly - "github.com/gocolly/colly/v2" - For crawling websites

Contributing

Right now the project is still pretty much done for my desire to read news on my Supernote (awesome gadget btw) so if you wanna do something clever just create a PR.

Contributors

lnenad

Licence

Licence is free for personal but paid for commercial, get in touch if you want to use the utility or code for commercial purposes.

Owner

Nenad

Speaking and typing many languages. Alternate company profile @nenadlukic

https://github.com/lnenad/newser

Newser is a simple utility to generate a pdf with you favorite news articles

Newser

Building

Configuration

Website Definitions

Deps

Contributing

Contributors

Licence

Owner

Nenad

Similar Resources

PDF file parser

create PDF from ASCII File for Cable labels

Convert document to pdf with golang

Ghostinthepdf - This is a small tool that helps to embed a PostScript file into a PDF

Read data from rss, convert in pdf and send to kindle. Amazon automatically convert them in azw3.

Go-wk - PDF Generation API with wkhtmltopdf

PDF Annotator of Nightmares 🎃

A simple utility for validating CSV columns

🌳 📂 The utility displays a tree of directories and files(symlinks in future).

Related tags

goldmark-pdf is a renderer for goldmark that allows rendering to PDF.

Starter files for the News application built with Go

QueryCSV enables you to load CSV files and manipulate them using SQL queries then after you finish you can export the new values to a CSV file

A simple library for generating PDF written in Go lang

Golang wrapper for Exiftool : extract as much metadata as possible (EXIF, ...) from files (pictures, pdf, office documents, ...)

A PDF processor written in Go.

A PDF document generator with high level support for text, drawing and images

PDF tools for reMarkable tablets

A command line tool for mainly exporting logbook records from Google Spreadsheet to PDF file in EASA format

A Docker-powered stateless API for PDF files.