1
Commit Graph

281 Commits

Author SHA1 Message Date
stephenmk
96358e3eb5
Fix function parameter
Sense numbers start at 1, not 0
2023-01-24 08:55:24 -06:00
stephenmk
ef1e74447d
Include term tags and scores in standalone forms dictionary 2023-01-23 23:52:42 -06:00
stephenmk
d606f729cf
Use secondary frequency tags in term score calculation
If a term has a frequency tag, it should return higher in search
results than a match which does not have a tag.

For example, a search for 素性 should return すじょう rather than
そせい, because the former has a "news" frequency tag.
2023-01-23 14:13:22 -06:00
stephenmk
6726c5245b
Rename variables for consistency 2023-01-23 14:09:50 -06:00
stephenmk
d8a3b420ee
Exclude "search" and "forms" terms from non-English dictionaries
This allows a user to install the English version and another version
without cluttering their setup with duplicated information.

If a user doesn't want to use the English version, they can get the
"search" and "forms" terms by installing the separate jmdict_forms
file.
2023-01-22 17:55:27 -06:00
stephenmk
8451803bfd
Update copyright 2023-01-22 15:00:13 -06:00
stephenmk
972dc6c4e9
Update dictionary build script 2023-01-22 14:40:39 -06:00
stephenmk
abc28bb19d
Add new JMdict version 2023-01-22 14:37:18 -06:00
stephenmk
73fb992865
Add intersection and union functions for string arrays 2023-01-22 14:32:45 -06:00
stephenmk
56f9895967
Add struct for handling index.json data 2023-01-22 14:27:02 -06:00
stephenmk
853d0b33dc
Use empty interface type for dictionary glossaries
Necesssary for structured content support
2023-01-22 14:14:33 -06:00
Alexei Yatskov
9222417bfd
Merge pull request #37 from toasted-nutbread/update-vs-rules
Update how suru verb rules are detected
2022-08-20 11:52:32 -07:00
toasted-nutbread
77d5d2debd Update how suru verb rules are detected 2022-08-14 15:35:20 -04:00
2168659243 Fix import path 2022-08-07 09:38:50 -07:00
Alexei Yatskov
b5d6095c06
Merge pull request #36 from 0x766F6964/update_daijisen
Update daijisen
2022-08-01 19:34:03 -07:00
Randy Palamar
5b8481e5bf remove duplicate newlines in definitions
this prevents entries from have empty lines which are particularly
annoying when using the popup dictionary in yomichan
2022-07-28 20:38:02 -06:00
Randy Palamar
94326126d3 update the daijisen regexps
this also fixes #5

the method used is a bit hacky but it works
2022-07-28 20:27:29 -06:00
Randy Palamar
8bc7ffdb36 add newlines to characters indicating sub-definitions
this will cause some things to be displayed incorrectly but overall
makes daijisen much more readable.
2022-07-28 20:25:35 -06:00
Randy Palamar
65df67b085 map most of daijisen
the remaining glyphs don't exist in unicode usually because they are
normally displayed using HTML or MathJax type things
2022-07-28 20:20:48 -06:00
Alexei Yatskov
57280ea5fd
Merge pull request #35 from univerio/shougakukan2
Add support for 小学館 中日・日中 統合辞書 第2版 EPWING
2022-07-14 21:18:53 -07:00
75207654d9 Update README 2022-07-14 14:24:32 -07:00
1fdf4f2998 Switch to foosoft.net for packages 2022-07-03 20:59:33 -07:00
Jack Zhou
c918a6bb5d Implement shougakukan2 2022-05-16 21:39:11 -07:00
Alex Yatskov
a4af996222
Merge pull request #31 from 0x766F6964/add_font_mappings
finish mapping most of daijirin
2022-02-05 18:23:22 -08:00
d61c1e0df6 Readme consistency 2022-02-05 18:22:07 -08:00
6b3aaf3886 Update readme 2022-02-05 18:20:31 -08:00
e16da37017 Update README 2021-12-15 18:06:35 -08:00
e9849380ea Add links 2021-12-14 20:32:29 -08:00
fc7fd48748 Add site metadata 2021-12-14 20:27:16 -08:00
Randy Palamar
6224b4c21f finish mapping most of daijirin
Now you can search for totally useful every day words like 瘟㾮日
and 多羅吒干𤚥 :^).

The characters that remain either don't exist in unicode or are very
difficult to find. Also a couple terms seem unsearchable in qolibri so
I couldn't check what the characters are supposed to be.

Any questionable choice was marked with FIXME. This will make it easy in
the future to replace some characters with their images if its something
that we want to support in the future.

* The FIXMEs with the missing font symbol should all be the correct
  character (not commonly covered by fonts)

* The くの字点 choices are to try and imitate the daijirin
  experience(TM). Probably the worst use of image fonts I've seen. Those
  characters should never appear in horizontal text. They should have
  just been replaced with the text that was supposed to be repeated.

* The 漢文訓読 characters in '{}' are technically the unicode specified
  characters for those glyphs however they just look like their full
  size variants. I surrounded them with '{}' so the examples that use
  them are still readable.

* The other FIXMEs should be self explanatory. Search the term in qolibri
  and look at what they used to see why they are questionable.
2021-06-17 07:56:14 -06:00
35175a5a1e Update README 2021-06-08 21:02:14 -07:00
83e3e44f46 Update build scripts 2021-01-10 17:38:03 -08:00
e50a31cc05 Fix permissions 2021-01-10 17:30:07 -08:00
09f42f3345 Windows scripts 2021-01-10 17:28:27 -08:00
5bc88dcaee Update modules 2021-01-09 22:40:22 -08:00
ecc52f3155 Update to non-bugged version of zero-epwing-go 2021-01-09 22:39:18 -08:00
921d81bf0b Add modules 2021-01-09 15:53:39 -08:00
5f7728fe45 Update README 2021-01-01 21:05:20 -08:00
81d97dd032 Fix import of epwing dictionaries that have a lower-case CATALOGS file 2021-01-01 17:28:12 -08:00
1172bc8b84 Add script to build dictionaries 2021-01-01 17:18:22 -08:00
16d75c71e2 Update copyright 2021-01-01 16:28:06 -08:00
9c60afc1bb UI improvements 2021-01-01 16:24:58 -08:00
ea34cf9a37 Update copyrights, UI 2021-01-01 16:18:55 -08:00
795af5caa1 Refactor 2021-01-01 14:31:58 -08:00
4751a786b9 Fix import paths for zig 2021-01-01 11:46:11 -08:00
b66d908b23 Switch to zig for EPWING parsing 2020-12-31 21:53:10 -08:00
50901f7155 Remove zero-epwing related files 2020-12-31 20:42:43 -08:00
Alex Yatskov
d65c8c4f5d
Merge pull request #24 from toasted-nutbread/support-vz
Add term rules for zuru verbs
2020-12-09 19:45:44 -08:00
Alex Yatskov
1809f15f0b
Merge pull request #23 from toasted-nutbread/add-winmanifest
Add winmanifest import
2020-12-05 21:29:29 -08:00
toasted-nutbread
3cb4f0e386 Add term rules for zuru verbs 2020-12-05 23:48:34 -05:00