Howto convert a webpage to readable text
From Wikihowto
You can change a webpage to a readable text file
[edit] Web based
[edit] Command line
You can accomplish this by using lynx
- Install lynx, if its not installed already
- http://lynx.isc.org/
- Gentoo: emerge lynx
- Grab the text
- Execute:
- lynx http://www.webpage.org -dump > output-file.txt
- Example:
- lynx http://www.gentoo.org/doc/en/handbook/handbook-x86.xml?full=1 -dump >handbook-x86.txt
Screenshot of Website used for the following example [1]
- Example output
#[1]Gentoo Website [2]Gentoo Forums [3]Gentoo Bugzilla [4]Gentoo
Packages [5]Gentoo List Archives
[6]Gentoo Logo
Gentoo Linux x86 Handbook
Content:
* [7]Installing Gentoo
In this part you learn how to install Gentoo on your system.
1. [8]About the Gentoo Linux Installation
This chapter introduces you to the installation approach
documented in this handbook.
2. [9]Choosing the Right Installation Medium
You can install Gentoo in many ways. This chapter explains how
to install Gentoo using the minimal Installation CD although
installation through the Installer LiveCD is possible as well.
3. [10]Configuring your Network
To be able to download the latest source code, you will need
to setup networking.
4. [11]Preparing the Disks
To be able to install Gentoo, you must create the necessary
partitions. This chapter describes how to partition a disk for
[edit] Graphical
to be added
