Update README
This commit is contained in:
parent
f9a80d3dac
commit
c51cf9738f
36
README.md
36
README.md
@ -30,35 +30,26 @@ Otherwise, you can use the [pre-built binaries](https://github.com/FooSoft/scraw
|
|||||||
Executing Scrawl with the `-help` command line argument will trigger online help to be displayed. Below is a more
|
Executing Scrawl with the `-help` command line argument will trigger online help to be displayed. Below is a more
|
||||||
detailed description of what the parameters do.
|
detailed description of what the parameters do.
|
||||||
|
|
||||||
* **attr**: The attribute containing the desired download path is specified by this argument.
|
* `attr`: The attribute containing the desired download path is specified by this argument.
|
||||||
* **dir**: This argument specifies the output directory for downloaded files.
|
* `dir`: This argument specifies the output directory for downloaded files.
|
||||||
* **vebose**: Scrawl will output more details about what it is currently doing when this flag is set.
|
* `vebose`: Scrawl will output more details about what it is currently doing when this flag is set.
|
||||||
|
|
||||||
## Example
|
## Example
|
||||||
|
|
||||||
Let's say we want to create a script to download the latest Debian package of [Anki](http://ankisrs.net/):
|
Let's say we want to create a script to download the latest Debian package of [Anki](http://ankisrs.net/):
|
||||||
|
|
||||||
1. We load up the homepage and are presented with a big download button as shown in the screenshot below:
|
1. We load up the homepage and are presented with a big download button as shown in the screenshot below: \
|
||||||
|
![](img/anki.png)
|
||||||
[![Anki Homepage](img/anki-thumb.png)](img/anki.png)
|
2. Let's copy that link so we can download the latest version with wGet or curl from our script at any time! Hmm, it
|
||||||
|
looks like the path `http://ankisrs.net/download/mirror/anki-2.0.33.deb` has the version number embedded in the
|
||||||
2. Let's copy that link so we can download the latest version with wGet or curl from our script at any time!
|
filename. This means that even after a new version of Anki is released, our script will keep getting version
|
||||||
|
|
||||||
Hmm, it looks like the path `http://ankisrs.net/download/mirror/anki-2.0.33.deb` has the version number embedded in
|
|
||||||
the filename. This means that even after a new version of Anki is released, our script will keep getting version
|
|
||||||
`2.0.33` (unless of course it gets deleted).
|
`2.0.33` (unless of course it gets deleted).
|
||||||
|
3. Let's inspect the download link in your favorite browser to see what additional information we can get: \
|
||||||
3. Let's inspect the download link in your favorite browser to see what additional information we can get:
|
![](img/inspect.png)
|
||||||
|
4. It appears that we can easily create a selector for this element: `#linux > a:nth-child(2)`. Note that
|
||||||
[![Inspector](img/inspect-thumb.png)](img/inspect.png)
|
[Chrome](https://www.google.com/chrome/) provides the option to copy the CSS selector for any element, making
|
||||||
|
knowledge of web technology optional for this step.
|
||||||
4. It appears that we can easily create a selector for this element: `#linux > a:nth-child(2)`.
|
|
||||||
|
|
||||||
Note that [Chrome](https://www.google.com/chrome/) provides the option to copy the CSS selector for any element,
|
|
||||||
making knowledge of web technology optional for this step.
|
|
||||||
|
|
||||||
5. Now let's create a simple download and install script:
|
5. Now let's create a simple download and install script:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
#!/bin/sh
|
#!/bin/sh
|
||||||
rm -rf /tmp/anki
|
rm -rf /tmp/anki
|
||||||
@ -67,7 +58,6 @@ Let's say we want to create a script to download the latest Debian package of [A
|
|||||||
sudo dpkg -i /tmp/anki/*.deb
|
sudo dpkg -i /tmp/anki/*.deb
|
||||||
sudo apt-get install -y -f
|
sudo apt-get install -y -f
|
||||||
```
|
```
|
||||||
|
|
||||||
In this script, we prepare an empty download directory and tell Scrawl to scrape `http://ankisrs.net/`, extracting
|
In this script, we prepare an empty download directory and tell Scrawl to scrape `http://ankisrs.net/`, extracting
|
||||||
the `href` property of the download link identified by the CSS selector `#linux > a:nth-child(2)`. We then install
|
the `href` property of the download link identified by the CSS selector `#linux > a:nth-child(2)`. We then install
|
||||||
the package and bring in any unsatisfied dependencies.
|
the package and bring in any unsatisfied dependencies.
|
||||||
|
Binary file not shown.
Before Width: | Height: | Size: 4.3 KiB |
Binary file not shown.
Before Width: | Height: | Size: 6.0 KiB |
Loading…
Reference in New Issue
Block a user