Current Path: > > bin
Operation : Linux premium131.web-hosting.com 4.18.0-553.44.1.lve.el8.x86_64 #1 SMP Thu Mar 13 14:29:12 UTC 2025 x86_64 Software : Apache Server IP : 162.0.232.56 | Your IP: 216.73.216.111 Domains : 1034 Domain(s) Permission : [ 0555 ]
Name | Type | Size | Last Modified | Actions |
---|---|---|---|---|
7za | File | 46 bytes | October 04 2024 01:50:29. | |
GET | File | 16217 bytes | April 22 2022 20:13:06. | |
File | 418704 bytes | October 13 2019 00:19:50. | ||
[ | File | 54960 bytes | April 01 2023 08:44:39. | |
aclocal | File | 36478 bytes | October 14 2023 20:00:43. | |
aclocal-1.16 | File | 36478 bytes | October 14 2023 20:00:43. | |
addr2line | File | 34216 bytes | November 15 2024 09:59:32. | |
animate | File | 12128 bytes | April 01 2025 13:15:58. | |
ar | File | 63448 bytes | November 15 2024 09:59:32. | |
arch | File | 38304 bytes | April 01 2023 08:44:39. | |
arpaname | File | 12104 bytes | February 20 2025 09:05:41. | |
as | File | 911264 bytes | November 15 2024 09:59:32. | |
aspell | File | 163328 bytes | April 18 2022 15:10:53. | |
at | File | 1036 bytes | October 10 2022 10:23:17. | |
atq | File | 1038 bytes | October 10 2022 10:23:17. | |
atrm | File | 1040 bytes | October 10 2022 10:23:17. | |
autoconf | File | 14768 bytes | August 13 2024 19:30:48. | |
autoheader | File | 8534 bytes | August 13 2024 19:30:48. | |
autom4te | File | 32181 bytes | August 13 2024 19:30:48. | |
automake | File | 257949 bytes | October 14 2023 20:00:43. | |
automake-1.16 | File | 257949 bytes | October 14 2023 20:00:43. | |
autoreconf | File | 21066 bytes | August 13 2024 19:30:48. | |
autoscan | File | 17124 bytes | August 13 2024 19:30:48. | |
autoupdate | File | 33872 bytes | August 13 2024 19:30:48. | |
awk | File | 685848 bytes | April 18 2022 15:56:05. | |
b2sum | File | 59144 bytes | April 01 2023 08:44:39. | |
base32 | File | 42544 bytes | April 01 2023 08:44:39. | |
base64 | File | 42560 bytes | April 01 2023 08:44:39. | |
basename | File | 38392 bytes | April 01 2023 08:44:39. | |
bash | File | 1150584 bytes | May 23 2024 08:55:42. | |
bashbug-64 | File | 7348 bytes | May 23 2024 08:55:35. | |
batch | File | 137 bytes | October 10 2022 10:23:17. | |
bison | File | 448224 bytes | October 12 2019 12:28:39. | |
bunzip2 | File | 37744 bytes | January 28 2025 01:38:59. | |
bzcat | File | 37744 bytes | January 28 2025 01:38:59. | |
bzcmp | File | 2128 bytes | January 28 2025 01:38:58. | |
bzdiff | File | 2128 bytes | January 28 2025 01:38:58. | |
bzgrep | File | 1677 bytes | January 28 2025 01:38:58. | |
bzip2 | File | 37744 bytes | January 28 2025 01:38:59. | |
bzip2recover | File | 16832 bytes | January 28 2025 01:38:59. | |
bzless | File | 1259 bytes | January 28 2025 01:38:58. | |
bzmore | File | 1259 bytes | January 28 2025 01:38:58. | |
c++ | File | 1270688 bytes | March 31 2025 10:38:33. | |
c++filt | File | 29584 bytes | November 15 2024 09:59:32. | |
c89 | File | 224 bytes | March 31 2025 10:36:31. | |
c99 | File | 215 bytes | March 31 2025 10:36:31. | |
cagefs_enter.proxied | File | 1056 bytes | May 26 2025 11:13:23. | |
cal | File | 67560 bytes | April 06 2024 13:02:53. | |
captoinfo | File | 87360 bytes | October 14 2023 18:54:49. | |
cat | File | 38440 bytes | April 01 2023 08:44:39. | |
catchsegv | File | 3283 bytes | June 09 2025 20:25:38. | |
cc | File | 1266552 bytes | March 31 2025 10:38:36. | |
chcon | File | 72120 bytes | April 01 2023 08:44:39. | |
chgrp | File | 67944 bytes | April 01 2023 08:44:39. | |
chmod | File | 63784 bytes | April 01 2023 08:44:39. | |
chown | File | 72080 bytes | April 01 2023 08:44:39. | |
chrt | File | 38072 bytes | April 06 2024 13:02:53. | |
cksum | File | 38360 bytes | April 01 2023 08:44:39. | |
cldetect | File | 10600 bytes | June 05 2025 09:53:15. | |
clear | File | 12840 bytes | October 14 2023 18:54:49. | |
clusterdb | File | 71920 bytes | February 27 2024 08:25:00. | |
cmp | File | 106248 bytes | May 04 2020 15:15:14. | |
col | File | 29696 bytes | April 06 2024 13:02:53. | |
colcrt | File | 16872 bytes | April 06 2024 13:02:53. | |
colrm | File | 25480 bytes | April 06 2024 13:02:53. | |
column | File | 50656 bytes | April 06 2024 13:02:53. | |
comm | File | 42632 bytes | April 01 2023 08:44:39. | |
compare | File | 12136 bytes | April 01 2025 13:15:58. | |
composite | File | 12120 bytes | April 01 2025 13:15:58. | |
conjure | File | 12120 bytes | April 01 2025 13:15:58. | |
convert | File | 12120 bytes | April 01 2025 13:15:58. | |
cp | File | 151600 bytes | April 01 2023 08:44:39. | |
cpan | File | 8370 bytes | June 03 2025 14:32:21. | |
cpp | File | 1266600 bytes | March 31 2025 10:38:33. | |
createdb | File | 71904 bytes | February 27 2024 08:25:00. | |
createuser | File | 76416 bytes | February 27 2024 08:25:00. | |
crontab | File | 1524 bytes | April 06 2024 11:40:05. | |
crontab.cagefs | File | 55464 bytes | June 23 2025 12:23:46. | |
csplit | File | 55048 bytes | April 01 2023 08:44:39. | |
curl | File | 235608 bytes | January 28 2025 01:43:10. | |
cut | File | 50784 bytes | April 01 2023 08:44:39. | |
date | File | 108576 bytes | April 01 2023 08:44:39. | |
dbiprof | File | 6206 bytes | April 25 2022 16:10:04. | |
dd | File | 79920 bytes | April 01 2023 08:44:39. | |
delv | File | 43480 bytes | February 20 2025 09:05:41. | |
df | File | 93352 bytes | April 01 2023 08:44:39. | |
diff | File | 274440 bytes | May 04 2020 15:15:14. | |
diff3 | File | 131688 bytes | May 04 2020 15:15:14. | |
dig | File | 166080 bytes | February 20 2025 09:05:41. | |
dir | File | 143328 bytes | April 01 2023 08:44:39. | |
dircolors | File | 50816 bytes | April 01 2023 08:44:39. | |
dirname | File | 34240 bytes | April 01 2023 08:44:39. | |
display | File | 12128 bytes | April 01 2025 13:15:58. | |
dnstap-read | File | 20920 bytes | February 20 2025 09:05:41. | |
dropdb | File | 67608 bytes | February 27 2024 08:25:00. | |
dropuser | File | 67576 bytes | February 27 2024 08:25:00. | |
du | File | 109672 bytes | April 01 2023 08:44:39. | |
easy_install-3 | File | bytes | January 01 1970 00:00:00. | |
echo | File | 38328 bytes | April 01 2023 08:44:39. | |
egrep | File | 28 bytes | October 11 2019 15:15:56. | |
enc2xs | File | 41958 bytes | October 13 2019 08:46:10. | |
enchant | File | 21584 bytes | October 23 2019 20:03:55. | |
enchant-lsmod | File | 13408 bytes | October 23 2019 20:03:55. | |
env | File | 42424 bytes | April 01 2023 08:44:39. | |
eps2eps | File | 639 bytes | June 03 2025 02:48:11. | |
eqn | File | 237728 bytes | October 13 2019 14:29:41. | |
ex | File | 1180368 bytes | August 02 2022 16:57:02. | |
expand | File | 42664 bytes | April 01 2023 08:44:39. | |
expr | File | 50840 bytes | April 01 2023 08:44:39. | |
factor | File | 88120 bytes | April 01 2023 08:44:39. | |
false | File | 34192 bytes | April 01 2023 08:44:39. | |
fc-cache | File | 132 bytes | August 16 2021 11:04:22. | |
fc-cache-64 | File | 20840 bytes | October 09 2021 09:02:09. | |
fc-cat | File | 16744 bytes | October 09 2021 09:02:09. | |
fc-conflist | File | 12544 bytes | October 09 2021 09:02:09. | |
fc-list | File | 12544 bytes | October 09 2021 09:02:09. | |
fc-match | File | 16648 bytes | October 09 2021 09:02:09. | |
fc-pattern | File | 12552 bytes | October 09 2021 09:02:09. | |
fc-query | File | 12536 bytes | October 09 2021 09:02:09. | |
fc-scan | File | 12552 bytes | October 09 2021 09:02:09. | |
fc-validate | File | 16648 bytes | October 09 2021 09:02:09. | |
fgrep | File | 28 bytes | October 11 2019 15:15:56. | |
file | File | 25272 bytes | April 06 2024 14:44:54. | |
find | File | 228664 bytes | September 24 2024 02:19:34. | |
flex | File | 438728 bytes | October 12 2019 12:33:17. | |
flex++ | File | 438728 bytes | October 12 2019 12:33:17. | |
flock | File | 33992 bytes | April 06 2024 13:02:53. | |
fmt | File | 46664 bytes | April 01 2023 08:44:39. | |
fold | File | 42480 bytes | April 01 2023 08:44:39. | |
free | File | 21288 bytes | October 14 2023 20:31:02. | |
freetype-config | File | 4416 bytes | March 31 2025 19:37:30. | |
funzip | File | 37504 bytes | June 03 2025 02:10:12. | |
g++ | File | 1270688 bytes | March 31 2025 10:38:33. | |
gawk | File | 685848 bytes | April 18 2022 15:56:05. | |
gcc | File | 1266552 bytes | March 31 2025 10:38:36. | |
gcc-ar | File | 37536 bytes | March 31 2025 10:38:33. | |
gcc-nm | File | 37536 bytes | March 31 2025 10:38:36. | |
gcc-ranlib | File | 37536 bytes | March 31 2025 10:38:36. | |
gcov | File | 1377552 bytes | March 31 2025 10:38:36. | |
gcov-dump | File | 584584 bytes | March 31 2025 10:38:36. | |
gcov-tool | File | 622336 bytes | March 31 2025 10:38:36. | |
gem | File | 542 bytes | May 14 2025 12:40:38. | |
gencat | File | 25432 bytes | June 09 2025 20:31:29. | |
geoiplookup | File | 22416 bytes | November 28 2019 22:06:01. | |
geoiplookup6 | File | 22168 bytes | November 28 2019 22:06:01. | |
geqn | File | 237728 bytes | October 13 2019 14:29:41. | |
getconf | File | 33240 bytes | June 09 2025 20:31:29. | |
getent | File | 33920 bytes | June 09 2025 20:31:29. | |
getopt | File | 21016 bytes | April 06 2024 13:02:53. | |
ghostscript | File | 12648 bytes | June 03 2025 02:48:22. | |
git | File | 3845928 bytes | July 23 2025 06:59:57. | |
git-receive-pack | File | 3845928 bytes | July 23 2025 06:59:57. | |
git-shell | File | 2236688 bytes | July 23 2025 06:59:57. | |
git-upload-archive | File | 3845928 bytes | July 23 2025 06:59:57. | |
git-upload-pack | File | 3845928 bytes | July 23 2025 06:59:57. | |
gm | File | 8008 bytes | March 28 2022 15:50:37. | |
gmake | File | 240968 bytes | April 18 2022 16:38:34. | |
gneqn | File | 908 bytes | October 13 2019 14:29:37. | |
gnroff | File | 3312 bytes | October 13 2019 14:29:39. | |
gpg | File | 1090528 bytes | September 13 2022 10:15:05. | |
gpg-agent | File | 429352 bytes | September 13 2022 10:15:05. | |
gpg-error | File | 34976 bytes | October 12 2019 12:20:46. | |
gpg-zip | File | 3525 bytes | September 13 2022 10:15:00. | |
gpgsplit | File | 89104 bytes | September 13 2022 10:15:05. | |
gpgv | File | 462416 bytes | September 13 2022 10:15:05. | |
gpic | File | 300896 bytes | October 13 2019 14:29:41. | |
gprof | File | 105840 bytes | November 15 2024 09:59:32. | |
grep | File | 198280 bytes | October 11 2019 15:15:57. | |
groff | File | 127920 bytes | October 13 2019 14:29:41. | |
grops | File | 195728 bytes | October 13 2019 14:29:41. | |
grotty | File | 145304 bytes | October 13 2019 14:29:41. | |
groups | File | 38368 bytes | April 01 2023 08:44:39. | |
gs | File | 12648 bytes | June 03 2025 02:48:22. | |
gsnd | File | 277 bytes | June 03 2025 02:48:11. | |
gsoelim | File | 43576 bytes | October 13 2019 14:29:41. | |
gtar | File | 459768 bytes | June 11 2025 14:11:24. | |
gtbl | File | 158320 bytes | October 13 2019 14:29:41. | |
gtroff | File | 824344 bytes | October 13 2019 14:29:41. | |
gunzip | File | 2345 bytes | April 27 2022 05:49:28. | |
gzexe | File | 6375 bytes | April 27 2022 05:49:28. | |
gzip | File | 96944 bytes | April 27 2022 05:49:29. | |
h2ph | File | 29382 bytes | July 28 2025 08:08:32. | |
h2xs | File | 60866 bytes | July 28 2025 08:07:09. | |
head | File | 46672 bytes | April 01 2023 08:44:39. | |
hexdump | File | 58880 bytes | April 06 2024 13:02:53. | |
host | File | 145712 bytes | February 20 2025 09:05:41. | |
hostid | File | 34208 bytes | April 01 2023 08:44:39. | |
hostname | File | 21664 bytes | October 11 2019 13:06:51. | |
hunspell | File | 148168 bytes | October 13 2019 08:33:15. | |
iconv | File | 62912 bytes | June 09 2025 20:31:29. | |
id | File | 46608 bytes | April 01 2023 08:44:39. | |
identify | File | 12128 bytes | April 01 2025 13:15:58. | |
idn | File | 40352 bytes | October 13 2019 16:55:35. | |
ifnames | File | 4128 bytes | August 13 2024 19:30:48. | |
import | File | 12120 bytes | April 01 2025 13:15:58. | |
infocmp | File | 62512 bytes | October 14 2023 18:54:49. | |
infotocap | File | 87360 bytes | October 14 2023 18:54:49. | |
install | File | 160000 bytes | April 01 2023 08:44:39. | |
instmodsh | File | 4194 bytes | October 13 2019 08:55:21. | |
ionice | File | 29680 bytes | April 06 2024 13:02:53. | |
ipcrm | File | 29688 bytes | April 06 2024 13:02:53. | |
ipcs | File | 54672 bytes | April 06 2024 13:02:53. | |
isosize | File | 25472 bytes | April 06 2024 13:02:53. | |
ispell | File | 988 bytes | April 18 2022 15:10:51. | |
join | File | 55056 bytes | April 01 2023 08:44:39. | |
kill | File | 38168 bytes | April 06 2024 13:02:53. | |
ld | File | 1789096 bytes | November 15 2024 09:59:32. | |
ld.bfd | File | 1789096 bytes | November 15 2024 09:59:32. | |
ldd | File | 5441 bytes | June 09 2025 20:25:41. | |
less | File | 177928 bytes | July 02 2024 20:10:37. | |
lessecho | File | 12696 bytes | July 02 2024 20:10:37. | |
lesskey | File | 22520 bytes | July 02 2024 20:10:37. | |
lesspipe.sh | File | 3143 bytes | July 02 2024 19:57:10. | |
lex | File | 438728 bytes | October 12 2019 12:33:17. | |
libnetcfg | File | 15775 bytes | July 28 2025 08:08:32. | |
libtool | File | 367724 bytes | October 11 2019 14:55:28. | |
libtoolize | File | 129197 bytes | October 11 2019 14:55:28. | |
link | File | 34208 bytes | April 01 2023 08:44:39. | |
ln | File | 72264 bytes | April 01 2023 08:44:39. | |
locale | File | 57800 bytes | June 09 2025 20:31:29. | |
localedef | File | 314848 bytes | June 09 2025 20:31:29. | |
logger | File | 51184 bytes | April 06 2024 13:02:53. | |
login | File | 41944 bytes | April 06 2024 13:02:53. | |
logname | File | 34224 bytes | April 01 2023 08:44:39. | |
look | File | 16848 bytes | April 06 2024 13:02:53. | |
ls | File | 143328 bytes | April 01 2023 08:44:39. | |
lynx | File | 1928720 bytes | April 18 2022 21:01:05. | |
m4 | File | 190016 bytes | October 11 2019 14:41:40. | |
File | 418704 bytes | October 13 2019 00:19:50. | ||
mailx | File | 418704 bytes | October 13 2019 00:19:50. | |
make | File | 240968 bytes | April 18 2022 16:38:34. | |
make-dummy-cert | File | 610 bytes | October 09 2024 13:39:22. | |
mariadb | File | 4545824 bytes | May 28 2025 17:32:49. | |
mariadb-access | File | 111961 bytes | May 28 2025 17:32:49. | |
mariadb-admin | File | 4070888 bytes | May 28 2025 17:32:49. | |
mariadb-binlog | File | 4346304 bytes | May 28 2025 17:32:49. | |
mariadb-check | File | 4063880 bytes | May 28 2025 17:32:49. | |
mariadb-conv | File | 3764080 bytes | May 28 2025 17:32:49. | |
mariadb-convert-table-format | File | 4221 bytes | May 28 2025 17:32:49. | |
mariadb-dump | File | 4155560 bytes | May 28 2025 17:32:49. | |
mariadb-dumpslow | File | 8242 bytes | May 28 2025 17:32:49. | |
mariadb-embedded | File | 23688240 bytes | May 28 2025 17:32:49. | |
mariadb-find-rows | File | 3290 bytes | May 28 2025 17:32:49. | |
mariadb-hotcopy | File | 34973 bytes | May 28 2025 17:32:49. | |
mariadb-import | File | 4058696 bytes | May 28 2025 17:32:49. | |
mariadb-plugin | File | 3742920 bytes | May 28 2025 17:32:49. | |
mariadb-secure-installation | File | 13810 bytes | May 28 2025 17:32:49. | |
mariadb-setpermission | File | 17977 bytes | May 28 2025 17:32:49. | |
mariadb-show | File | 4053024 bytes | May 28 2025 17:32:49. | |
mariadb-slap | File | 4076624 bytes | May 28 2025 17:32:49. | |
mariadb-tzinfo-to-sql | File | 3738248 bytes | May 28 2025 17:32:49. | |
mariadb-waitpid | File | 3725568 bytes | May 28 2025 17:32:49. | |
mc | File | 1360320 bytes | October 18 2019 20:38:40. | |
mcdiff | File | 1360320 bytes | October 18 2019 20:38:40. | |
mcedit | File | 1360320 bytes | October 18 2019 20:38:40. | |
mcookie | File | 34056 bytes | April 06 2024 13:02:53. | |
mcview | File | 1360320 bytes | October 18 2019 20:38:40. | |
md5sum | File | 46712 bytes | April 01 2023 08:44:39. | |
mesg | File | 16752 bytes | April 06 2024 13:02:53. | |
mkdir | File | 84776 bytes | April 01 2023 08:44:39. | |
mkfifo | File | 68160 bytes | April 01 2023 08:44:39. | |
mknod | File | 72248 bytes | April 01 2023 08:44:39. | |
mktemp | File | 46832 bytes | April 01 2023 08:44:39. | |
mogrify | File | 12120 bytes | April 01 2025 13:15:58. | |
montage | File | 12120 bytes | April 01 2025 13:15:58. | |
more | File | 46016 bytes | April 06 2024 13:02:53. | |
msql2mysql | File | 1446 bytes | May 28 2025 17:32:49. | |
mv | File | 147488 bytes | April 01 2023 08:44:39. | |
my_print_defaults | File | 3730496 bytes | May 28 2025 17:32:49. | |
mysql | File | 4545824 bytes | May 28 2025 17:32:49. | |
mysql_config | File | 4622 bytes | May 28 2025 17:32:51. | |
mysql_find_rows | File | 3290 bytes | May 28 2025 17:32:49. | |
mysql_waitpid | File | 3725568 bytes | May 28 2025 17:32:49. | |
mysqlaccess | File | 111961 bytes | May 28 2025 17:32:49. | |
mysqladmin | File | 4070888 bytes | May 28 2025 17:32:49. | |
mysqlbinlog | File | 4346304 bytes | May 28 2025 17:32:49. | |
mysqlcheck | File | 4063880 bytes | May 28 2025 17:32:49. | |
mysqldump | File | 4155560 bytes | May 28 2025 17:32:49. | |
mysqlimport | File | 4058696 bytes | May 28 2025 17:32:49. | |
mysqlshow | File | 4053024 bytes | May 28 2025 17:32:49. | |
mytop | File | 73681 bytes | May 28 2025 17:32:49. | |
namei | File | 33896 bytes | April 06 2024 13:02:53. | |
nano | File | 253888 bytes | September 24 2024 02:16:19. | |
neqn | File | 908 bytes | October 13 2019 14:29:37. | |
nice | File | 38312 bytes | April 01 2023 08:44:39. | |
nl | File | 46720 bytes | April 01 2023 08:44:39. | |
nm | File | 51584 bytes | November 15 2024 09:59:32. | |
nohup | File | 38384 bytes | April 01 2023 08:44:39. | |
nproc | File | 38384 bytes | April 01 2023 08:44:39. | |
nroff | File | 3312 bytes | October 13 2019 14:29:39. | |
nslookup | File | 149768 bytes | February 20 2025 09:05:41. | |
nsupdate | File | 74808 bytes | February 20 2025 09:05:41. | |
numfmt | File | 67288 bytes | April 01 2023 08:44:39. | |
objcopy | File | 245832 bytes | November 15 2024 09:59:32. | |
objdump | File | 429832 bytes | November 15 2024 09:59:32. | |
od | File | 75656 bytes | April 01 2023 08:44:39. | |
openssl | File | 763856 bytes | October 09 2024 13:39:36. | |
pango-list | File | 12160 bytes | October 08 2021 15:22:06. | |
pango-view | File | 58816 bytes | October 08 2021 15:22:06. | |
passwd | File | 1044 bytes | April 18 2022 22:59:33. | |
paste | File | 38360 bytes | April 01 2023 08:44:39. | |
patch | File | 211416 bytes | June 01 2020 15:14:25. | |
pathchk | File | 38304 bytes | April 01 2023 08:44:39. | |
pdf2dsc | File | 698 bytes | June 03 2025 02:48:11. | |
pdf2ps | File | 909 bytes | June 03 2025 02:48:11. | |
perl | File | 12728 bytes | July 28 2025 08:07:20. | |
perl5.26.3 | File | 12728 bytes | July 28 2025 08:07:20. | |
perlbug | File | 45458 bytes | July 28 2025 08:08:32. | |
perldoc | File | 118 bytes | October 13 2019 11:53:57. | |
perlivp | File | 10813 bytes | July 28 2025 08:07:09. | |
perlml | File | 7024 bytes | August 10 2022 20:54:28. | |
perlthanks | File | 45458 bytes | July 28 2025 08:08:32. | |
pg_dump | File | 409016 bytes | February 27 2024 08:25:00. | |
pg_dumpall | File | 109680 bytes | February 27 2024 08:25:00. | |
pg_restore | File | 177504 bytes | February 27 2024 08:25:00. | |
pgrep | File | 29536 bytes | October 14 2023 20:31:02. | |
php | File | 937 bytes | April 03 2025 11:08:35. | |
pic | File | 300896 bytes | October 13 2019 14:29:41. | |
piconv | File | 8271 bytes | October 13 2019 08:46:10. | |
pinentry | File | 2404 bytes | August 12 2018 17:18:10. | |
pinentry-curses | File | 79760 bytes | November 24 2019 17:39:34. | |
ping | File | 67712 bytes | October 14 2023 17:19:01. | |
pinky | File | 42528 bytes | April 01 2023 08:44:39. | |
pip-3 | File | bytes | January 01 1970 00:00:00. | |
pip3 | File | bytes | January 01 1970 00:00:00. | |
pkg-config | File | 41000 bytes | October 13 2019 06:57:31. | |
pkgconf | File | 41000 bytes | October 13 2019 06:57:31. | |
pkill | File | 29536 bytes | October 14 2023 20:31:02. | |
pl2pm | File | 4533 bytes | July 28 2025 08:08:16. | |
pmap | File | 33568 bytes | October 14 2023 20:31:02. | |
pod2html | File | 4134 bytes | July 28 2025 08:08:32. | |
pod2man | File | 15034 bytes | October 13 2019 14:12:32. | |
pod2text | File | 10803 bytes | October 13 2019 14:12:32. | |
pod2usage | File | 3948 bytes | October 13 2019 11:57:18. | |
podchecker | File | 3658 bytes | October 13 2019 11:46:35. | |
podselect | File | 2527 bytes | October 13 2019 11:52:55. | |
post-grohtml | File | 244456 bytes | October 13 2019 14:29:41. | |
pr | File | 84200 bytes | April 01 2023 08:44:39. | |
pre-grohtml | File | 133688 bytes | October 13 2019 14:29:41. | |
precat | File | 5656 bytes | April 18 2022 15:10:50. | |
preunzip | File | 5656 bytes | April 18 2022 15:10:50. | |
prezip | File | 5656 bytes | April 18 2022 15:10:50. | |
prezip-bin | File | 12264 bytes | April 18 2022 15:10:53. | |
printenv | File | 34200 bytes | April 01 2023 08:44:39. | |
printf | File | 54928 bytes | April 01 2023 08:44:39. | |
prove | File | 13562 bytes | October 13 2019 12:42:34. | |
ps | File | 137984 bytes | October 14 2023 20:31:02. | |
ps2ascii | File | 631 bytes | June 03 2025 02:48:11. | |
ps2epsi | File | 2752 bytes | June 03 2025 02:48:11. | |
ps2pdf | File | 272 bytes | June 03 2025 02:48:11. | |
ps2pdf12 | File | 215 bytes | June 03 2025 02:48:11. | |
ps2pdf13 | File | 215 bytes | June 03 2025 02:48:11. | |
ps2pdf14 | File | 215 bytes | June 03 2025 02:48:11. | |
ps2pdfwr | File | 1097 bytes | June 03 2025 02:48:11. | |
ps2ps | File | 647 bytes | June 03 2025 02:48:11. | |
ps2ps2 | File | 669 bytes | June 03 2025 02:48:11. | |
psql | File | 659792 bytes | February 27 2024 08:25:00. | |
ptx | File | 79944 bytes | April 01 2023 08:44:39. | |
pwd | File | 38400 bytes | April 01 2023 08:44:39. | |
pwdx | File | 12984 bytes | October 14 2023 20:31:02. | |
pydoc-3 | File | bytes | January 01 1970 00:00:00. | |
pydoc3 | File | bytes | January 01 1970 00:00:00. | |
python2 | File | 8032 bytes | April 10 2024 04:58:50. | |
python2.7 | File | 8032 bytes | April 10 2024 04:58:50. | |
python3 | File | 11872 bytes | July 01 2025 22:14:50. | |
python3.6 | File | 11872 bytes | July 01 2025 22:14:50. | |
python3.6m | File | 11872 bytes | July 01 2025 22:14:50. | |
pyvenv-3 | File | bytes | January 01 1970 00:00:00. | |
ranlib | File | 63464 bytes | November 15 2024 09:59:32. | |
raw | File | 16888 bytes | April 06 2024 13:02:53. | |
readelf | File | 639528 bytes | November 15 2024 09:59:32. | |
readlink | File | 47064 bytes | April 01 2023 08:44:39. | |
realpath | File | 51216 bytes | April 01 2023 08:44:39. | |
recode | File | 48160 bytes | October 18 2019 15:18:02. | |
reindexdb | File | 72008 bytes | February 27 2024 08:25:00. | |
rename | File | 16896 bytes | April 06 2024 13:02:53. | |
renew-dummy-cert | File | 725 bytes | October 09 2024 13:39:22. | |
renice | File | 16856 bytes | April 06 2024 13:02:53. | |
replace | File | 3708040 bytes | May 28 2025 17:32:49. | |
reset | File | 25352 bytes | October 14 2023 18:54:49. | |
rev | File | 12752 bytes | April 06 2024 13:02:53. | |
rm | File | 72160 bytes | April 01 2023 08:44:39. | |
rmdir | File | 46632 bytes | April 01 2023 08:44:39. | |
rnano | File | 253888 bytes | September 24 2024 02:16:19. | |
rsync | File | 522392 bytes | June 03 2025 02:02:49. | |
ruby | File | 12128 bytes | May 14 2025 12:43:45. | |
run-with-aspell | File | 85 bytes | April 18 2022 15:10:50. | |
runcon | File | 38344 bytes | April 01 2023 08:44:39. | |
rvi | File | 1180368 bytes | August 02 2022 16:57:02. | |
rview | File | 1180368 bytes | August 02 2022 16:57:02. | |
rvim | File | 3067760 bytes | August 02 2022 16:57:02. | |
scalar | File | 2290912 bytes | July 23 2025 06:59:57. | |
scl | File | 37752 bytes | April 01 2023 16:10:31. | |
scl_enabled | File | 258 bytes | August 25 2017 08:23:02. | |
scl_source | File | 1863 bytes | April 01 2023 16:10:31. | |
scp | File | 105320 bytes | February 19 2025 20:59:40. | |
screen | File | 494040 bytes | May 05 2021 10:04:06. | |
script | File | 37672 bytes | April 06 2024 13:02:53. | |
sdiff | File | 107856 bytes | May 04 2020 15:15:14. | |
sed | File | 118248 bytes | April 18 2022 21:41:04. | |
selectorctl | File | 7812 bytes | July 11 2025 17:46:23. | |
seq | File | 54808 bytes | April 01 2023 08:44:39. | |
setsid | File | 16768 bytes | April 06 2024 13:02:53. | |
setterm | File | 46200 bytes | April 06 2024 13:02:53. | |
sftp | File | 163576 bytes | February 19 2025 20:59:40. | |
sh | File | 1150584 bytes | May 23 2024 08:55:42. | |
sha1sum | File | 46720 bytes | April 01 2023 08:44:39. | |
sha224sum | File | 46752 bytes | April 01 2023 08:44:39. | |
sha256sum | File | 46752 bytes | April 01 2023 08:44:39. | |
sha384sum | File | 46760 bytes | April 01 2023 08:44:39. | |
sha512sum | File | 46760 bytes | April 01 2023 08:44:39. | |
shred | File | 63424 bytes | April 01 2023 08:44:39. | |
shuf | File | 59560 bytes | April 01 2023 08:44:39. | |
size | File | 34048 bytes | November 15 2024 09:59:32. | |
skill | File | 29488 bytes | October 14 2023 20:31:02. | |
slabtop | File | 21344 bytes | October 14 2023 20:31:02. | |
sleep | File | 38368 bytes | April 01 2023 08:44:39. | |
snice | File | 29488 bytes | October 14 2023 20:31:02. | |
soelim | File | 43576 bytes | October 13 2019 14:29:41. | |
sort | File | 126512 bytes | April 01 2023 08:44:39. | |
spell | File | 122 bytes | April 18 2022 15:10:51. | |
splain | File | 19150 bytes | July 28 2025 08:08:32. | |
split | File | 59520 bytes | April 01 2023 08:44:39. | |
sprof | File | 29360 bytes | June 09 2025 20:31:29. | |
sqlite3 | File | 1346512 bytes | July 29 2025 01:27:56. | |
ssh | File | 775720 bytes | February 19 2025 20:59:40. | |
ssh-add | File | 354432 bytes | February 19 2025 20:59:40. | |
ssh-agent | File | 333392 bytes | February 19 2025 20:59:40. | |
ssh-copy-id | File | 10694 bytes | February 19 2025 20:59:39. | |
ssh-keygen | File | 437416 bytes | February 19 2025 20:59:40. | |
ssh-keyscan | File | 438856 bytes | February 19 2025 20:59:40. | |
stat | File | 88304 bytes | April 01 2023 08:44:39. | |
stdbuf | File | 50768 bytes | April 01 2023 08:44:39. | |
strace | File | 2029832 bytes | January 28 2025 01:32:17. | |
stream | File | 12112 bytes | April 01 2025 13:15:58. | |
strings | File | 38328 bytes | November 15 2024 09:59:32. | |
strip | File | 245856 bytes | November 15 2024 09:59:32. | |
stty | File | 79544 bytes | April 01 2023 08:44:39. | |
sum | File | 46704 bytes | April 01 2023 08:44:39. | |
sync | File | 38328 bytes | April 01 2023 08:44:39. | |
tabs | File | 16952 bytes | October 14 2023 18:54:49. | |
tac | File | 42568 bytes | April 01 2023 08:44:39. | |
tail | File | 75984 bytes | April 01 2023 08:44:39. | |
tar | File | 459768 bytes | June 11 2025 14:11:24. | |
taskset | File | 38144 bytes | April 06 2024 13:02:53. | |
tbl | File | 158320 bytes | October 13 2019 14:29:41. | |
tclsh | File | 9256 bytes | October 12 2019 00:25:32. | |
tclsh8.6 | File | 9256 bytes | October 12 2019 00:25:32. | |
tee | File | 42552 bytes | April 01 2023 08:44:39. | |
test | File | 54912 bytes | April 01 2023 08:44:39. | |
tic | File | 87360 bytes | October 14 2023 18:54:49. | |
timeout | File | 42936 bytes | April 01 2023 08:44:39. | |
tload | File | 17160 bytes | October 14 2023 20:31:02. | |
tmpwatch | File | 36320 bytes | October 12 2019 11:32:29. | |
toe | File | 16848 bytes | October 14 2023 18:54:49. | |
top | File | 124616 bytes | October 14 2023 20:31:02. | |
touch | File | 96272 bytes | April 01 2023 08:44:39. | |
tput | File | 25392 bytes | October 14 2023 18:54:49. | |
tr | File | 50896 bytes | April 01 2023 08:44:39. | |
traceroute | File | 72672 bytes | March 11 2025 07:45:53. | |
troff | File | 824344 bytes | October 13 2019 14:29:41. | |
true | File | 34200 bytes | April 01 2023 08:44:39. | |
truncate | File | 42432 bytes | April 01 2023 08:44:39. | |
tset | File | 25352 bytes | October 14 2023 18:54:49. | |
tsort | File | 42568 bytes | April 01 2023 08:44:39. | |
tty | File | 34192 bytes | April 01 2023 08:44:39. | |
tzselect | File | 15370 bytes | June 09 2025 20:25:25. | |
uapi | File | 1040 bytes | June 28 2025 23:07:04. | |
ul | File | 21072 bytes | April 06 2024 13:02:53. | |
uname | File | 38304 bytes | April 01 2023 08:44:39. | |
unexpand | File | 46776 bytes | April 01 2023 08:44:39. | |
uniq | File | 50912 bytes | April 01 2023 08:44:39. | |
unlink | File | 34216 bytes | April 01 2023 08:44:39. | |
unversioned-python | File | bytes | January 01 1970 00:00:00. | |
unzip | File | 206728 bytes | June 03 2025 02:10:12. | |
unzipsfx | File | 103912 bytes | June 03 2025 02:10:12. | |
uptime | File | 12888 bytes | October 14 2023 20:31:02. | |
users | File | 38368 bytes | April 01 2023 08:44:39. | |
utmpdump | File | 29344 bytes | April 06 2024 13:02:53. | |
vacuumdb | File | 80344 bytes | February 27 2024 08:25:00. | |
vdir | File | 143328 bytes | April 01 2023 08:44:39. | |
vi | File | 1180368 bytes | August 02 2022 16:57:02. | |
view | File | 1180368 bytes | August 02 2022 16:57:02. | |
vim | File | 3067760 bytes | August 02 2022 16:57:02. | |
vimdiff | File | 3067760 bytes | August 02 2022 16:57:02. | |
vimtutor | File | 2121 bytes | August 02 2022 16:56:59. | |
vmstat | File | 37672 bytes | October 14 2023 20:31:02. | |
watch | File | 29888 bytes | October 14 2023 20:31:02. | |
wc | File | 50912 bytes | April 01 2023 08:44:39. | |
wget | File | 533928 bytes | August 13 2024 22:22:56. | |
whereis | File | 29976 bytes | April 06 2024 13:02:53. | |
which | File | 30144 bytes | October 15 2023 02:50:46. | |
who | File | 54968 bytes | April 01 2023 08:44:39. | |
whoami | File | 34216 bytes | April 01 2023 08:44:39. | |
word-list-compress | File | 12280 bytes | April 18 2022 15:10:53. | |
x86_64-redhat-linux-c++ | File | 1270688 bytes | March 31 2025 10:38:33. | |
x86_64-redhat-linux-g++ | File | 1270688 bytes | March 31 2025 10:38:33. | |
x86_64-redhat-linux-gcc | File | 1266552 bytes | March 31 2025 10:38:36. | |
x86_64-redhat-linux-gcc-8 | File | 1266552 bytes | March 31 2025 10:38:36. | |
xargs | File | 75888 bytes | September 24 2024 02:19:34. | |
xmlcatalog | File | 20864 bytes | July 10 2025 07:02:26. | |
xmllint | File | 75128 bytes | July 10 2025 07:02:26. | |
xmlwf | File | 33752 bytes | April 15 2025 19:58:56. | |
xsltproc | File | 29152 bytes | June 09 2025 15:03:50. | |
xsubpp | File | 5080 bytes | October 13 2019 08:58:35. | |
xxd | File | 21016 bytes | August 02 2022 16:57:02. | |
yes | File | 34248 bytes | April 01 2023 08:44:39. | |
zcat | File | 1983 bytes | April 27 2022 05:49:28. | |
zcmp | File | 1677 bytes | April 27 2022 05:49:28. | |
zdiff | File | 5879 bytes | April 27 2022 05:49:28. | |
zegrep | File | 29 bytes | April 27 2022 05:49:28. | |
zfgrep | File | 29 bytes | April 27 2022 05:49:28. | |
zforce | File | 2080 bytes | April 27 2022 05:49:28. | |
zgrep | File | 7582 bytes | April 27 2022 05:49:28. | |
zip | File | 234496 bytes | October 11 2019 13:11:04. | |
zipcloak | File | 105376 bytes | October 11 2019 13:11:04. | |
zipgrep | File | 2953 bytes | October 10 2008 17:40:36. | |
zipinfo | File | 206728 bytes | June 03 2025 02:10:12. | |
zipnote | File | 100104 bytes | October 11 2019 13:11:04. | |
zipsplit | File | 100104 bytes | October 11 2019 13:11:04. | |
zless | File | 2205 bytes | April 27 2022 05:49:28. | |
zmore | File | 1841 bytes | April 27 2022 05:49:28. | |
znew | File | 4552 bytes | April 27 2022 05:49:28. | |
zsoelim | File | 43576 bytes | October 13 2019 14:29:41. |
#!/usr/bin/perl BEGIN { # @INC poking no longer needed w/ new MakeMaker and Makefile.PL's # with $ENV{PERL_CORE} set # In case we need it in future... require Config; import Config; pop @INC if $INC[-1] eq '.'; } use strict; use warnings; use Getopt::Std; use Config; my @orig_ARGV = @ARGV; our $VERSION = do { my @r = (q$Revision: 2.21 $ =~ /\d+/g); sprintf "%d."."%02d" x $#r, @r }; # These may get re-ordered. # RAW is a do_now as inserted by &enter # AGG is an aggregated do_now, as built up by &process use constant { RAW_NEXT => 0, RAW_IN_LEN => 1, RAW_OUT_BYTES => 2, RAW_FALLBACK => 3, AGG_MIN_IN => 0, AGG_MAX_IN => 1, AGG_OUT_BYTES => 2, AGG_NEXT => 3, AGG_IN_LEN => 4, AGG_OUT_LEN => 5, AGG_FALLBACK => 6, }; # (See the algorithm in encengine.c - we're building structures for it) # There are two sorts of structures. # "do_now" (an array, two variants of what needs storing) is whatever we need # to do now we've read an input byte. # It's housed in a "do_next" (which is how we got to it), and in turn points # to a "do_next" which contains all the "do_now"s for the next input byte. # There will be a "do_next" which is the start state. # For a single byte encoding it's the only "do_next" - each "do_now" points # back to it, and each "do_now" will cause bytes. There is no state. # For a multi-byte encoding where all characters in the input are the same # length, then there will be a tree of "do_now"->"do_next"->"do_now" # branching out from the start state, one step for each input byte. # The leaf "do_now"s will all be at the same distance from the start state, # only the leaf "do_now"s cause output bytes, and they in turn point back to # the start state. # For an encoding where there are variable length input byte sequences, you # will encounter a leaf "do_now" sooner for the shorter input sequences, but # as before the leaves will point back to the start state. # The system will cope with escape encodings (imagine them as a mostly # self-contained tree for each escape state, and cross links between trees # at the state-switching characters) but so far no input format defines these. # The system will also cope with having output "leaves" in the middle of # the bifurcating branches, not just at the extremities, but again no # input format does this yet. # There are two variants of the "do_now" structure. The first, smaller variant # is generated by &enter as the input file is read. There is one structure # for each input byte. Say we are mapping a single byte encoding to a # single byte encoding, with "ABCD" going "abcd". There will be # 4 "do_now"s, {"A" => [...,"a",...], "B" => [...,"b",...], "C"=>..., "D"=>...} # &process then walks the tree, building aggregate "do_now" structures for # adjacent bytes where possible. The aggregate is for a contiguous range of # bytes which each produce the same length of output, each move to the # same next state, and each have the same fallback flag. # So our 4 RAW "do_now"s above become replaced by a single structure # containing: # ["A", "D", "abcd", 1, ...] # ie, for an input byte $_ in "A".."D", output 1 byte, found as # substr ("abcd", (ord $_ - ord "A") * 1, 1) # which maps very nicely into pointer arithmetic in C for encengine.c sub encode_U { # UTF-8 encode long hand - only covers part of perl's range ## my $uv = shift; # chr() works in native space so convert value from table # into that space before using chr(). my $ch = chr(utf8::unicode_to_native($_[0])); # Now get core perl to encode that the way it likes. utf8::encode($ch); return $ch; } sub encode_S { # encode single byte ## my ($ch,$page) = @_; return chr($ch); return chr $_[0]; } sub encode_D { # encode double byte MS byte first ## my ($ch,$page) = @_; return chr($page).chr($ch); return chr ($_[1]) . chr $_[0]; } sub encode_M { # encode Multi-byte - single for 0..255 otherwise double ## my ($ch,$page) = @_; ## return &encode_D if $page; ## return &encode_S; return chr ($_[1]) . chr $_[0] if $_[1]; return chr $_[0]; } my %encode_types = (U => \&encode_U, S => \&encode_S, D => \&encode_D, M => \&encode_M, ); # Win32 does not expand globs on command line if ($^O eq 'MSWin32' and !$ENV{PERL_CORE}) { eval "\@ARGV = map(glob(\$_),\@ARGV)"; @ARGV = @orig_ARGV unless @ARGV; } my %opt; # I think these are: # -Q to disable the duplicate codepoint test # -S make mapping errors fatal # -q to remove comments written to output files # -O to enable the (brute force) substring optimiser # -o <output> to specify the output file name (else it's the first arg) # -f <inlist> to give a file with a list of input files (else use the args) # -n <name> to name the encoding (else use the basename of the input file. #Getopt::Long::Configure("bundling"); #GetOptions(\%opt, qw(C M=s S Q q O o=s f=s n=s v)); getopts('CM:SQqOo:f:n:v',\%opt); $opt{M} and make_makefile_pl($opt{M}, @ARGV); $opt{C} and make_configlocal_pm($opt{C}, @ARGV); $opt{v} ||= $ENV{ENC2XS_VERBOSE}; sub verbose { print STDERR @_ if $opt{v}; } sub verbosef { printf STDERR @_ if $opt{v}; } # ($cpp, $static, $sized) = compiler_info($declaration) # # return some information about the compiler and compile options we're using: # # $declaration - true if we're doing a declaration rather than a definition. # # $cpp - we're using C++ # $static - ok to declare the arrays as static # $sized - the array declarations should be sized sub compiler_info { my ($declaration) = @_; my $ccflags = $Config{ccflags}; if (defined $Config{ccwarnflags}) { $ccflags .= " " . $Config{ccwarnflags}; } my $compat = $ccflags =~ /\Q-Wc++-compat/; my $pedantic = $ccflags =~ /-pedantic/; my $cpp = ($Config{d_cplusplus} || '') eq 'define'; # The encpage_t tables contain recursive and mutually recursive # references. To allow them to compile under C++ and some restrictive # cc options, it may be necessary to make the tables non-static/const # (thus moving them from the text to the data segment) and/or not # include the size in the declaration. my $static = !( $cpp || ($compat && $pedantic) || ($^O eq 'MacOS' && $declaration) ); # -Wc++-compat on its own warns if the array declaration is sized. # The easiest way to avoid this warning is simply not to include # the size in the declaration. # With -pedantic as well, the issue doesn't arise because $static # above becomes false. my $sized = $declaration && !($compat && !$pedantic); return ($cpp, $static, $sized); } # This really should go first, else the die here causes empty (non-erroneous) # output files to be written. my @encfiles; if (exists $opt{f}) { # -F is followed by name of file containing list of filenames my $flist = $opt{f}; open(FLIST,$flist) || die "Cannot open $flist:$!"; chomp(@encfiles = <FLIST>); close(FLIST); } else { @encfiles = @ARGV; } my $cname = $opt{o} ? $opt{o} : shift(@ARGV); unless ($cname) { #debuging a win32 nmake error-only. works via cmdline print "\nARGV:"; print "$_ " for @ARGV; print "\nopt:"; print " $_ => ",defined $opt{$_}?$opt{$_}:"undef","\n" for keys %opt; } chmod(0666,$cname) if -f $cname && !-w $cname; open(C,">", $cname) || die "Cannot open $cname:$!"; my $dname = $cname; my $hname = $cname; my ($doC,$doEnc,$doUcm,$doPet); if ($cname =~ /\.(c|xs)$/i) # VMS may have upcased filenames with DECC$ARGV_PARSE_STYLE defined { $doC = 1; $dname =~ s/(\.[^\.]*)?$/.exh/; chmod(0666,$dname) if -f $cname && !-w $dname; open(D,">", $dname) || die "Cannot open $dname:$!"; $hname =~ s/(\.[^\.]*)?$/.h/; chmod(0666,$hname) if -f $cname && !-w $hname; open(H,">", $hname) || die "Cannot open $hname:$!"; foreach my $fh (\*C,\*D,\*H) { print $fh <<"END" unless $opt{'q'}; /* !!!!!!! DO NOT EDIT THIS FILE !!!!!!! This file was autogenerated by: $^X $0 @orig_ARGV enc2xs VERSION $VERSION */ END } if ($cname =~ /(\w+)\.xs$/) { print C "#define PERL_NO_GET_CONTEXT\n"; print C "#include <EXTERN.h>\n"; print C "#include <perl.h>\n"; print C "#include <XSUB.h>\n"; } print C "#include \"encode.h\"\n\n"; } elsif ($cname =~ /\.enc$/) { $doEnc = 1; } elsif ($cname =~ /\.ucm$/) { $doUcm = 1; } elsif ($cname =~ /\.pet$/) { $doPet = 1; } my %encoding; my %strings; my $string_acc; my %strings_in_acc; my $saved = 0; my $subsave = 0; my $strings = 0; sub cmp_name { if ($a =~ /^.*-(\d+)/) { my $an = $1; if ($b =~ /^.*-(\d+)/) { my $r = $an <=> $1; return $r if $r; } } return $a cmp $b; } foreach my $enc (sort cmp_name @encfiles) { my ($name,$sfx) = $enc =~ /^.*?([\w-]+)\.(enc|ucm)$/; $name = $opt{'n'} if exists $opt{'n'}; if (open(E,$enc)) { if ($sfx eq 'enc') { compile_enc(\*E,lc($name)); } else { compile_ucm(\*E,lc($name)); } } else { warn "Cannot open $enc for $name:$!"; } } if ($doC) { verbose "Writing compiled form\n"; foreach my $name (sort cmp_name keys %encoding) { my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}}; process($name.'_utf8',$e2u); addstrings(\*C,$e2u); process('utf8_'.$name,$u2e); addstrings(\*C,$u2e); } outbigstring(\*C,"enctable"); foreach my $name (sort cmp_name keys %encoding) { my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}}; outtable(\*C,$e2u, "enctable"); outtable(\*C,$u2e, "enctable"); # push(@{$encoding{$name}},outstring(\*C,$e2u->{Cname}.'_def',$erep)); } my ($cpp) = compiler_info(0); my $ext = $cpp ? 'extern "C"' : "extern"; my $exta = $cpp ? 'extern "C"' : "static"; my $extb = $cpp ? 'extern "C"' : ""; foreach my $enc (sort cmp_name keys %encoding) { # my ($e2u,$u2e,$rep,$min_el,$max_el,$rsym) = @{$encoding{$enc}}; my ($e2u,$u2e,$rep,$min_el,$max_el) = @{$encoding{$enc}}; #my @info = ($e2u->{Cname},$u2e->{Cname},$rsym,length($rep),$min_el,$max_el); my $replen = 0; $replen++ while($rep =~ /\G\\x[0-9A-Fa-f]/g); my $sym = "${enc}_encoding"; $sym =~ s/\W+/_/g; my @info = ($e2u->{Cname},$u2e->{Cname},"${sym}_rep_character",$replen, $min_el,$max_el); print C "${exta} const U8 ${sym}_rep_character[] = \"$rep\";\n"; print C "${exta} const char ${sym}_enc_name[] = \"$enc\";\n\n"; print C "${extb} const encode_t $sym = \n"; # This is to make null encoding work -- dankogai for (my $i = (scalar @info) - 1; $i >= 0; --$i){ $info[$i] ||= 1; } # end of null tweak -- dankogai print C " {",join(',',@info,"{${sym}_enc_name,(const char *)0}"),"};\n\n"; } foreach my $enc (sort cmp_name keys %encoding) { my $sym = "${enc}_encoding"; $sym =~ s/\W+/_/g; print H "${ext} encode_t $sym;\n"; print D " Encode_XSEncoding(aTHX_ &$sym);\n"; } if ($cname =~ /(\w+)\.xs$/) { my $mod = $1; print C <<'END'; static void Encode_XSEncoding(pTHX_ encode_t *enc) { dSP; HV *stash = gv_stashpv("Encode::XS", TRUE); SV *iv = newSViv(PTR2IV(enc)); SV *sv = sv_bless(newRV_noinc(iv),stash); int i = 0; /* with the SvLEN() == 0 hack, PVX won't be freed. We cast away name's constness, in the hope that perl won't mess with it. */ assert(SvTYPE(iv) >= SVt_PV); assert(SvLEN(iv) == 0); SvFLAGS(iv) |= SVp_POK; SvPVX(iv) = (char*) enc->name[0]; PUSHMARK(sp); XPUSHs(sv); while (enc->name[i]) { const char *name = enc->name[i++]; XPUSHs(sv_2mortal(newSVpvn(name,strlen(name)))); } PUTBACK; call_pv("Encode::define_encoding",G_DISCARD); SvREFCNT_dec(sv); } END print C "\nMODULE = Encode::$mod\tPACKAGE = Encode::$mod\n\n"; print C "BOOT:\n{\n"; print C "#include \"$dname\"\n"; print C "}\n"; } # Close in void context is bad, m'kay close(D) or warn "Error closing '$dname': $!"; close(H) or warn "Error closing '$hname': $!"; my $perc_saved = $saved/($strings + $saved) * 100; my $perc_subsaved = $subsave/($strings + $subsave) * 100; verbosef "%d bytes in string tables\n",$strings; verbosef "%d bytes (%.3g%%) saved spotting duplicates\n", $saved, $perc_saved if $saved; verbosef "%d bytes (%.3g%%) saved using substrings\n", $subsave, $perc_subsaved if $subsave; } elsif ($doEnc) { foreach my $name (sort cmp_name keys %encoding) { my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}}; output_enc(\*C,$name,$e2u); } } elsif ($doUcm) { foreach my $name (sort cmp_name keys %encoding) { my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}}; output_ucm(\*C,$name,$u2e,$erep,$min_el,$max_el); } } # writing half meg files and then not checking to see if you just filled the # disk is bad, m'kay close(C) or die "Error closing '$cname': $!"; # End of the main program. sub compile_ucm { my ($fh,$name) = @_; my $e2u = {}; my $u2e = {}; my $cs; my %attr; while (<$fh>) { s/#.*$//; last if /^\s*CHARMAP\s*$/i; if (/^\s*<(\w+)>\s+"?([^"]*)"?\s*$/i) # " # Grrr { $attr{$1} = $2; } } if (!defined($cs = $attr{'code_set_name'})) { warn "No <code_set_name> in $name\n"; } else { $name = $cs unless exists $opt{'n'}; } my $erep; my $urep; my $max_el; my $min_el; if (exists $attr{'subchar'}) { #my @byte; #$attr{'subchar'} =~ /^\s*/cg; #push(@byte,$1) while $attr{'subchar'} =~ /\G\\x([0-9a-f]+)/icg; #$erep = join('',map(chr(hex($_)),@byte)); $erep = $attr{'subchar'}; $erep =~ s/^\s+//; $erep =~ s/\s+$//; } print "Reading $name ($cs)\n" unless defined $ENV{MAKEFLAGS} and $ENV{MAKEFLAGS} =~ /\b(s|silent|quiet)\b/; my $nfb = 0; my $hfb = 0; while (<$fh>) { s/#.*$//; last if /^\s*END\s+CHARMAP\s*$/i; next if /^\s*$/; my (@uni, @byte) = (); my ($uni, $byte, $fb) = m/^(\S+)\s+(\S+)\s+(\S+)\s+/o or die "Bad line: $_"; while ($uni =~ m/\G<([U0-9a-fA-F\+]+)>/g){ push @uni, map { substr($_, 1) } split(/\+/, $1); } while ($byte =~ m/\G\\x([0-9a-fA-F]+)/g){ push @byte, $1; } if (@uni) { my $uch = join('', map { encode_U(hex($_)) } @uni ); my $ech = join('',map(chr(hex($_)),@byte)); my $el = length($ech); $max_el = $el if (!defined($max_el) || $el > $max_el); $min_el = $el if (!defined($min_el) || $el < $min_el); if (length($fb)) { $fb = substr($fb,1); $hfb++; } else { $nfb++; $fb = '0'; } # $fb is fallback flag # 0 - round trip safe # 1 - fallback for unicode -> enc # 2 - skip sub-char mapping # 3 - fallback enc -> unicode enter($u2e,$uch,$ech,$u2e,$fb+0) if ($fb =~ /[01]/); enter($e2u,$ech,$uch,$e2u,$fb+0) if ($fb =~ /[03]/); } else { warn $_; } } if ($nfb && $hfb) { die "$nfb entries without fallback, $hfb entries with\n"; } $encoding{$name} = [$e2u,$u2e,$erep,$min_el,$max_el]; } sub compile_enc { my ($fh,$name) = @_; my $e2u = {}; my $u2e = {}; my $type; while ($type = <$fh>) { last if $type !~ /^\s*#/; } chomp($type); return if $type eq 'E'; # Do the hash lookup once, rather than once per function call. 4% speedup. my $type_func = $encode_types{$type}; my ($def,$sym,$pages) = split(/\s+/,scalar(<$fh>)); warn "$type encoded $name\n"; my $rep = ''; # Save a defined test by setting these to defined values. my $min_el = ~0; # A very big integer my $max_el = 0; # Anything must be longer than 0 { my $v = hex($def); $rep = &$type_func($v & 0xFF, ($v >> 8) & 0xffe); } my $errors; my $seen; # use -Q to silence the seen test. Makefile.PL uses this by default. $seen = {} unless $opt{Q}; do { my $line = <$fh>; chomp($line); my $page = hex($line); my $ch = 0; my $i = 16; do { # So why is it 1% faster to leave the my here? my $line = <$fh>; $line =~ s/\r\n$/\n/; die "$.:${line}Line should be exactly 65 characters long including newline (".length($line).")" unless length ($line) == 65; # Split line into groups of 4 hex digits, convert groups to ints # This takes 65.35 # map {hex $_} $line =~ /(....)/g # This takes 63.75 (2.5% less time) # unpack "n*", pack "H*", $line # There's an implicit loop in map. Loops are bad, m'kay. Ops are bad, m'kay # Doing it as while ($line =~ /(....)/g) took 74.63 foreach my $val (unpack "n*", pack "H*", $line) { next if $val == 0xFFFD; my $ech = &$type_func($ch,$page); if ($val || (!$ch && !$page)) { my $el = length($ech); $max_el = $el if $el > $max_el; $min_el = $el if $el < $min_el; my $uch = encode_U($val); if ($seen) { # We're doing the test. # We don't need to read this quickly, so storing it as a scalar, # rather than 3 (anon array, plus the 2 scalars it holds) saves # RAM and may make us faster on low RAM systems. [see __END__] if (exists $seen->{$uch}) { warn sprintf("U%04X is %02X%02X and %04X\n", $val,$page,$ch,$seen->{$uch}); $errors++; } else { $seen->{$uch} = $page << 8 | $ch; } } # Passing 2 extra args each time is 3.6% slower! # Even with having to add $fallback ||= 0 later enter_fb0($e2u,$ech,$uch); enter_fb0($u2e,$uch,$ech); } else { # No character at this position # enter($e2u,$ech,undef,$e2u); } $ch++; } } while --$i; } while --$pages; die "\$min_el=$min_el, \$max_el=$max_el - seems we read no lines" if $min_el > $max_el; die "$errors mapping conflicts\n" if ($errors && $opt{'S'}); $encoding{$name} = [$e2u,$u2e,$rep,$min_el,$max_el]; } # my ($a,$s,$d,$t,$fb) = @_; sub enter { my ($current,$inbytes,$outbytes,$next,$fallback) = @_; # state we shift to after this (multibyte) input character defaults to same # as current state. $next ||= $current; # Making sure it is defined seems to be faster than {no warnings;} in # &process, or passing it in as 0 explicitly. # XXX $fallback ||= 0; # Start at the beginning and work forwards through the string to zero. # effectively we are removing 1 character from the front each time # but we don't actually edit the string. [this alone seems to be 14% speedup] # Hence -$pos is the length of the remaining string. my $pos = -length $inbytes; while (1) { my $byte = substr $inbytes, $pos, 1; # RAW_NEXT => 0, # RAW_IN_LEN => 1, # RAW_OUT_BYTES => 2, # RAW_FALLBACK => 3, # to unicode an array would seem to be better, because the pages are dense. # from unicode can be very sparse, favouring a hash. # hash using the bytes (all length 1) as keys rather than ord value, # as it's easier to sort these in &process. # It's faster to always add $fallback even if it's undef, rather than # choosing between 3 and 4 element array. (hence why we set it defined # above) my $do_now = $current->{Raw}{$byte} ||= [{},-$pos,'',$fallback]; # When $pos was -1 we were at the last input character. unless (++$pos) { $do_now->[RAW_OUT_BYTES] = $outbytes; $do_now->[RAW_NEXT] = $next; return; } # Tail recursion. The intermediate state may not have a name yet. $current = $do_now->[RAW_NEXT]; } } # This is purely for optimisation. It's just &enter hard coded for $fallback # of 0, using only a 3 entry array ref to save memory for every entry. sub enter_fb0 { my ($current,$inbytes,$outbytes,$next) = @_; $next ||= $current; my $pos = -length $inbytes; while (1) { my $byte = substr $inbytes, $pos, 1; my $do_now = $current->{Raw}{$byte} ||= [{},-$pos,'']; unless (++$pos) { $do_now->[RAW_OUT_BYTES] = $outbytes; $do_now->[RAW_NEXT] = $next; return; } $current = $do_now->[RAW_NEXT]; } } sub process { my ($name,$a) = @_; $name =~ s/\W+/_/g; $a->{Cname} = $name; my $raw = $a->{Raw}; my ($l, $agg_max_in, $agg_next, $agg_in_len, $agg_out_len, $agg_fallback); my @ent; $agg_max_in = 0; foreach my $key (sort keys %$raw) { # RAW_NEXT => 0, # RAW_IN_LEN => 1, # RAW_OUT_BYTES => 2, # RAW_FALLBACK => 3, my ($next, $in_len, $out_bytes, $fallback) = @{$raw->{$key}}; # Now we are converting from raw to aggregate, switch from 1 byte strings # to numbers my $b = ord $key; $fallback ||= 0; if ($l && # If this == fails, we're going to reset $agg_max_in below anyway. $b == ++$agg_max_in && # References in numeric context give the pointer as an int. $agg_next == $next && $agg_in_len == $in_len && $agg_out_len == length $out_bytes && $agg_fallback == $fallback # && length($l->[AGG_OUT_BYTES]) < 16 ) { # my $i = ord($b)-ord($l->[AGG_MIN_IN]); # we can aggregate this byte onto the end. $l->[AGG_MAX_IN] = $b; $l->[AGG_OUT_BYTES] .= $out_bytes; } else { # AGG_MIN_IN => 0, # AGG_MAX_IN => 1, # AGG_OUT_BYTES => 2, # AGG_NEXT => 3, # AGG_IN_LEN => 4, # AGG_OUT_LEN => 5, # AGG_FALLBACK => 6, # Reset the last thing we saw, plus set 5 lexicals to save some derefs. # (only gains .6% on euc-jp -- is it worth it?) push @ent, $l = [$b, $agg_max_in = $b, $out_bytes, $agg_next = $next, $agg_in_len = $in_len, $agg_out_len = length $out_bytes, $agg_fallback = $fallback]; } if (exists $next->{Cname}) { $next->{'Forward'} = 1 if $next != $a; } else { process(sprintf("%s_%02x",$name,$b),$next); } } # encengine.c rules say that last entry must be for 255 if ($agg_max_in < 255) { push @ent, [1+$agg_max_in, 255,undef,$a,0,0]; } $a->{'Entries'} = \@ent; } sub addstrings { my ($fh,$a) = @_; my $name = $a->{'Cname'}; # String tables foreach my $b (@{$a->{'Entries'}}) { next unless $b->[AGG_OUT_LEN]; $strings{$b->[AGG_OUT_BYTES]} = undef; } if ($a->{'Forward'}) { my ($cpp, $static, $sized) = compiler_info(1); my $count = $sized ? scalar(@{$a->{'Entries'}}) : ''; if ($static) { # we cannot ask Config for d_plusplus since we can override CC=g++-6 on the cmdline print $fh "#ifdef __cplusplus\n"; # -fpermissive since g++-6 print $fh "extern encpage_t $name\[$count];\n"; print $fh "#else\n"; print $fh "static const encpage_t $name\[$count];\n"; print $fh "#endif\n"; } else { print $fh "extern encpage_t $name\[$count];\n"; } } $a->{'DoneStrings'} = 1; foreach my $b (@{$a->{'Entries'}}) { my ($s,$e,$out,$t,$end,$l) = @$b; addstrings($fh,$t) unless $t->{'DoneStrings'}; } } sub outbigstring { my ($fh,$name) = @_; $string_acc = ''; # Make the big string in the string accumulator. Longest first, on the hope # that this makes it more likely that we find the short strings later on. # Not sure if it helps sorting strings of the same length lexically. foreach my $s (sort {length $b <=> length $a || $a cmp $b} keys %strings) { my $index = index $string_acc, $s; if ($index >= 0) { $saved += length($s); $strings_in_acc{$s} = $index; } else { OPTIMISER: { if ($opt{'O'}) { my $sublength = length $s; while (--$sublength > 0) { # progressively lop characters off the end, to see if the start of # the new string overlaps the end of the accumulator. if (substr ($string_acc, -$sublength) eq substr ($s, 0, $sublength)) { $subsave += $sublength; $strings_in_acc{$s} = length ($string_acc) - $sublength; # append the last bit on the end. $string_acc .= substr ($s, $sublength); last OPTIMISER; } # or if the end of the new string overlaps the start of the # accumulator next unless substr ($string_acc, 0, $sublength) eq substr ($s, -$sublength); # well, the last $sublength characters of the accumulator match. # so as we're prepending to the accumulator, need to shift all our # existing offsets forwards $_ += $sublength foreach values %strings_in_acc; $subsave += $sublength; $strings_in_acc{$s} = 0; # append the first bit on the start. $string_acc = substr ($s, 0, -$sublength) . $string_acc; last OPTIMISER; } } # Optimiser (if it ran) found nothing, so just going have to tack the # whole thing on the end. $strings_in_acc{$s} = length $string_acc; $string_acc .= $s; }; } } $strings = length $string_acc; my ($cpp) = compiler_info(0); my $var = $cpp ? '' : 'static'; my $definition = "\n$var const U8 $name\[$strings] = { " . join(',',unpack "C*",$string_acc); # We have a single long line. Split it at convenient commas. print $fh $1, "\n" while $definition =~ /\G(.{74,77},)/gcs; print $fh substr ($definition, pos $definition), " };\n"; } sub findstring { my ($name,$s) = @_; my $offset = $strings_in_acc{$s}; die "Can't find string " . join (',',unpack "C*",$s) . " in accumulator" unless defined $offset; "$name + $offset"; } sub outtable { my ($fh,$a,$bigname) = @_; my $name = $a->{'Cname'}; $a->{'Done'} = 1; foreach my $b (@{$a->{'Entries'}}) { my ($s,$e,$out,$t,$end,$l) = @$b; outtable($fh,$t,$bigname) unless $t->{'Done'}; } my ($cpp, $static) = compiler_info(0); my $count = scalar(@{$a->{'Entries'}}); if ($static) { print $fh "#ifdef __cplusplus\n"; # -fpermissive since g++-6 print $fh "encpage_t $name\[$count] = {\n"; print $fh "#else\n"; print $fh "static const encpage_t $name\[$count] = {\n"; print $fh "#endif\n"; } else { print $fh "\nencpage_t $name\[$count] = {\n"; } foreach my $b (@{$a->{'Entries'}}) { my ($sc,$ec,$out,$t,$end,$l,$fb) = @$b; # $end |= 0x80 if $fb; # what the heck was on your mind, Nick? -- Dan print $fh "{"; if ($l) { printf $fh findstring($bigname,$out); } else { print $fh "0"; } print $fh ",",$t->{Cname}; printf $fh ",0x%02x,0x%02x,$l,$end},\n",$sc,$ec; } print $fh "};\n"; } sub output_enc { my ($fh,$name,$a) = @_; die "Changed - fix me for new structure"; foreach my $b (sort keys %$a) { my ($s,$e,$out,$t,$end,$l,$fb) = @{$a->{$b}}; } } sub decode_U { my $s = shift; } my @uname; sub char_names { my $s = do "unicore/Name.pl"; die "char_names: unicore/Name.pl: $!\n" unless defined $s; pos($s) = 0; while ($s =~ /\G([0-9a-f]+)\t([0-9a-f]*)\t(.*?)\s*\n/igc) { my $name = $3; my $s = hex($1); last if $s >= 0x10000; my $e = length($2) ? hex($2) : $s; for (my $i = $s; $i <= $e; $i++) { $uname[$i] = $name; # print sprintf("U%04X $name\n",$i); } } } sub output_ucm_page { my ($cmap,$a,$t,$pre) = @_; # warn sprintf("Page %x\n",$pre); my $raw = $t->{Raw}; foreach my $key (sort keys %$raw) { # RAW_NEXT => 0, # RAW_IN_LEN => 1, # RAW_OUT_BYTES => 2, # RAW_FALLBACK => 3, my ($next, $in_len, $out_bytes, $fallback) = @{$raw->{$key}}; my $u = ord $key; $fallback ||= 0; if ($next != $a && $next != $t) { output_ucm_page($cmap,$a,$next,(($pre|($u &0x3F)) << 6)&0xFFFF); } elsif (length $out_bytes) { if ($pre) { $u = $pre|($u &0x3f); } my $s = sprintf "<U%04X> ",$u; #foreach my $c (split(//,$out_bytes)) { # $s .= sprintf "\\x%02X",ord($c); #} # 9.5% faster changing that loop to this: $s .= sprintf +("\\x%02X" x length $out_bytes), unpack "C*", $out_bytes; $s .= sprintf " |%d # %s\n",($fallback ? 1 : 0),$uname[$u]; push(@$cmap,$s); } else { warn join(',',$u, @{$raw->{$key}},$a,$t); } } } sub output_ucm { my ($fh,$name,$h,$rep,$min_el,$max_el) = @_; print $fh "# $0 @orig_ARGV\n" unless $opt{'q'}; print $fh "<code_set_name> \"$name\"\n"; char_names(); if (defined $min_el) { print $fh "<mb_cur_min> $min_el\n"; } if (defined $max_el) { print $fh "<mb_cur_max> $max_el\n"; } if (defined $rep) { print $fh "<subchar> "; foreach my $c (split(//,$rep)) { printf $fh "\\x%02X",ord($c); } print $fh "\n"; } my @cmap; output_ucm_page(\@cmap,$h,$h,0); print $fh "#\nCHARMAP\n"; foreach my $line (sort { substr($a,8) cmp substr($b,8) } @cmap) { print $fh $line; } print $fh "END CHARMAP\n"; } use vars qw( $_Enc2xs $_Version $_Inc $_E2X $_Name $_TableFiles $_Now ); sub find_e2x{ eval { require File::Find; }; my (@inc, %e2x_dir); for my $inc (@INC){ push @inc, $inc unless $inc eq '.'; #skip current dir } File::Find::find( sub { my ($dev,$ino,$mode,$nlink,$uid,$gid,$rdev,$size, $atime,$mtime,$ctime,$blksize,$blocks) = lstat($_) or return; -f _ or return; if (/^.*\.e2x$/o){ no warnings 'once'; $e2x_dir{$File::Find::dir} ||= $mtime; } return; }, @inc); warn join("\n", keys %e2x_dir), "\n"; for my $d (sort {$e2x_dir{$a} <=> $e2x_dir{$b}} keys %e2x_dir){ $_E2X = $d; # warn "$_E2X => ", scalar localtime($e2x_dir{$d}); return $_E2X; } } sub make_makefile_pl { eval { require Encode } or die "You need to install Encode to use enc2xs -M\nerror: $@\n"; # our used for variable expansion $_Enc2xs = $0; $_Version = $VERSION; $_E2X = find_e2x(); $_Name = shift; $_TableFiles = join(",", map {qq('$_')} @_); $_Now = scalar localtime(); eval { require File::Spec; }; _print_expand(File::Spec->catfile($_E2X,"Makefile_PL.e2x"),"Makefile.PL"); _print_expand(File::Spec->catfile($_E2X,"_PM.e2x"), "$_Name.pm"); _print_expand(File::Spec->catfile($_E2X,"_T.e2x"), "t/$_Name.t"); _print_expand(File::Spec->catfile($_E2X,"README.e2x"), "README"); _print_expand(File::Spec->catfile($_E2X,"Changes.e2x"), "Changes"); exit; } use vars qw( $_ModLines $_LocalVer ); sub make_configlocal_pm { eval { require Encode } or die "Unable to require Encode: $@\n"; eval { require File::Spec; }; # our used for variable expantion my %in_core = map { $_ => 1 } ( 'ascii', 'iso-8859-1', 'utf8', 'ascii-ctrl', 'null', 'utf-8-strict' ); my %LocalMod = (); # check @enc; use File::Find (); my $wanted = sub{ -f $_ or return; $File::Find::name =~ /\A\./ and return; $File::Find::name =~ /\.pm\z/ or return; $File::Find::name =~ m/\bEncode\b/ or return; my $mod = $File::Find::name; $mod =~ s/.*\bEncode\b/Encode/o; $mod =~ s/\.pm\z//o; $mod =~ s,/,::,og; eval qq{ require $mod; } or return; warn qq{ require $mod;\n}; for my $enc ( Encode->encodings() ) { no warnings; $in_core{$enc} and next; $Encode::Config::ExtModule{$enc} and next; $LocalMod{$enc} ||= $mod; } }; File::Find::find({wanted => $wanted}, @INC); $_ModLines = ""; for my $enc ( sort keys %LocalMod ) { $_ModLines .= qq(\$Encode::ExtModule{'$enc'} = "$LocalMod{$enc}";\n); } warn $_ModLines if $_ModLines; $_LocalVer = _mkversion(); $_E2X = find_e2x(); $_Inc = $INC{"Encode.pm"}; $_Inc =~ s/\.pm$//o; _print_expand( File::Spec->catfile( $_E2X, "ConfigLocal_PM.e2x" ), File::Spec->catfile( $_Inc, "ConfigLocal.pm" ), 1 ); exit; } sub _mkversion{ # v-string is now depreciated; use time() instead; #my ($ss,$mm,$hh,$dd,$mo,$yyyy) = localtime(); #$yyyy += 1900, $mo +=1; #return sprintf("v%04d.%04d.%04d", $yyyy, $mo*100+$dd, $hh*100+$mm); return time(); } sub _print_expand{ eval { require File::Basename } or die "File::Basename needed. Are you on miniperl?;\nerror: $@\n"; File::Basename->import(); my ($src, $dst, $clobber) = @_; if (!$clobber and -e $dst){ warn "$dst exists. skipping\n"; return; } warn "Generating $dst...\n"; open my $in, $src or die "$src : $!"; if ((my $d = dirname($dst)) ne '.'){ -d $d or mkdir $d, 0755 or die "mkdir $d : $!"; } open my $out, ">", $dst or die "$!"; my $asis = 0; while (<$in>){ if (/^#### END_OF_HEADER/){ $asis = 1; next; } s/(\$_[A-Z][A-Za-z0-9]+)_/$1/gee unless $asis; print $out $_; } } __END__ =head1 NAME enc2xs -- Perl Encode Module Generator =head1 SYNOPSIS enc2xs -[options] enc2xs -M ModName mapfiles... enc2xs -C =head1 DESCRIPTION F<enc2xs> builds a Perl extension for use by Encode from either Unicode Character Mapping files (.ucm) or Tcl Encoding Files (.enc). Besides being used internally during the build process of the Encode module, you can use F<enc2xs> to add your own encoding to perl. No knowledge of XS is necessary. =head1 Quick Guide If you want to know as little about Perl as possible but need to add a new encoding, just read this chapter and forget the rest. =over 4 =item 0.Z<> Have a .ucm file ready. You can get it from somewhere or you can write your own from scratch or you can grab one from the Encode distribution and customize it. For the UCM format, see the next Chapter. In the example below, I'll call my theoretical encoding myascii, defined in I<my.ucm>. C<$> is a shell prompt. $ ls -F my.ucm =item 1.Z<> Issue a command as follows; $ enc2xs -M My my.ucm generating Makefile.PL generating My.pm generating README generating Changes Now take a look at your current directory. It should look like this. $ ls -F Makefile.PL My.pm my.ucm t/ The following files were created. Makefile.PL - MakeMaker script My.pm - Encode submodule t/My.t - test file =over 4 =item 1.1.Z<> If you want *.ucm installed together with the modules, do as follows; $ mkdir Encode $ mv *.ucm Encode $ enc2xs -M My Encode/*ucm =back =item 2.Z<> Edit the files generated. You don't have to if you have no time AND no intention to give it to someone else. But it is a good idea to edit the pod and to add more tests. =item 3.Z<> Now issue a command all Perl Mongers love: $ perl Makefile.PL Writing Makefile for Encode::My =item 4.Z<> Now all you have to do is make. $ make cp My.pm blib/lib/Encode/My.pm /usr/local/bin/perl /usr/local/bin/enc2xs -Q -O \ -o encode_t.c -f encode_t.fnm Reading myascii (myascii) Writing compiled form 128 bytes in string tables 384 bytes (75%) saved spotting duplicates 1 bytes (0.775%) saved using substrings .... chmod 644 blib/arch/auto/Encode/My/My.bs $ The time it takes varies depending on how fast your machine is and how large your encoding is. Unless you are working on something big like euc-tw, it won't take too long. =item 5.Z<> You can "make install" already but you should test first. $ make test PERL_DL_NONLAZY=1 /usr/local/bin/perl -Iblib/arch -Iblib/lib \ -e 'use Test::Harness qw(&runtests $verbose); \ $verbose=0; runtests @ARGV;' t/*.t t/My....ok All tests successful. Files=1, Tests=2, 0 wallclock secs ( 0.09 cusr + 0.01 csys = 0.09 CPU) =item 6.Z<> If you are content with the test result, just "make install" =item 7.Z<> If you want to add your encoding to Encode's demand-loading list (so you don't have to "use Encode::YourEncoding"), run enc2xs -C to update Encode::ConfigLocal, a module that controls local settings. After that, "use Encode;" is enough to load your encodings on demand. =back =head1 The Unicode Character Map Encode uses the Unicode Character Map (UCM) format for source character mappings. This format is used by IBM's ICU package and was adopted by Nick Ing-Simmons for use with the Encode module. Since UCM is more flexible than Tcl's Encoding Map and far more user-friendly, this is the recommended format for Encode now. A UCM file looks like this. # # Comments # <code_set_name> "US-ascii" # Required <code_set_alias> "ascii" # Optional <mb_cur_min> 1 # Required; usually 1 <mb_cur_max> 1 # Max. # of bytes/char <subchar> \x3F # Substitution char # CHARMAP <U0000> \x00 |0 # <control> <U0001> \x01 |0 # <control> <U0002> \x02 |0 # <control> .... <U007C> \x7C |0 # VERTICAL LINE <U007D> \x7D |0 # RIGHT CURLY BRACKET <U007E> \x7E |0 # TILDE <U007F> \x7F |0 # <control> END CHARMAP =over 4 =item * Anything that follows C<#> is treated as a comment. =item * The header section continues until a line containing the word CHARMAP. This section has a form of I<E<lt>keywordE<gt> value>, one pair per line. Strings used as values must be quoted. Barewords are treated as numbers. I<\xXX> represents a byte. Most of the keywords are self-explanatory. I<subchar> means substitution character, not subcharacter. When you decode a Unicode sequence to this encoding but no matching character is found, the byte sequence defined here will be used. For most cases, the value here is \x3F; in ASCII, this is a question mark. =item * CHARMAP starts the character map section. Each line has a form as follows: <UXXXX> \xXX.. |0 # comment ^ ^ ^ | | +- Fallback flag | +-------- Encoded byte sequence +-------------- Unicode Character ID in hex The format is roughly the same as a header section except for the fallback flag: | followed by 0..3. The meaning of the possible values is as follows: =over 4 =item |0 Round trip safe. A character decoded to Unicode encodes back to the same byte sequence. Most characters have this flag. =item |1 Fallback for unicode -> encoding. When seen, enc2xs adds this character for the encode map only. =item |2 Skip sub-char mapping should there be no code point. =item |3 Fallback for encoding -> unicode. When seen, enc2xs adds this character for the decode map only. =back =item * And finally, END OF CHARMAP ends the section. =back When you are manually creating a UCM file, you should copy ascii.ucm or an existing encoding which is close to yours, rather than write your own from scratch. When you do so, make sure you leave at least B<U0000> to B<U0020> as is, unless your environment is EBCDIC. B<CAVEAT>: not all features in UCM are implemented. For example, icu:state is not used. Because of that, you need to write a perl module if you want to support algorithmical encodings, notably the ISO-2022 series. Such modules include L<Encode::JP::2022_JP>, L<Encode::KR::2022_KR>, and L<Encode::TW::HZ>. =head2 Coping with duplicate mappings When you create a map, you SHOULD make your mappings round-trip safe. That is, C<encode('your-encoding', decode('your-encoding', $data)) eq $data> stands for all characters that are marked as C<|0>. Here is how to make sure: =over 4 =item * Sort your map in Unicode order. =item * When you have a duplicate entry, mark either one with '|1' or '|3'. =item * And make sure the '|1' or '|3' entry FOLLOWS the '|0' entry. =back Here is an example from big5-eten. <U2550> \xF9\xF9 |0 <U2550> \xA2\xA4 |3 Internally Encoding -> Unicode and Unicode -> Encoding Map looks like this; E to U U to E -------------------------------------- \xF9\xF9 => U2550 U2550 => \xF9\xF9 \xA2\xA4 => U2550 So it is round-trip safe for \xF9\xF9. But if the line above is upside down, here is what happens. E to U U to E -------------------------------------- \xA2\xA4 => U2550 U2550 => \xF9\xF9 (\xF9\xF9 => U2550 is now overwritten!) The Encode package comes with F<ucmlint>, a crude but sufficient utility to check the integrity of a UCM file. Check under the Encode/bin directory for this. When in doubt, you can use F<ucmsort>, yet another utility under Encode/bin directory. =head1 Bookmarks =over 4 =item * ICU Home Page L<http://www.icu-project.org/> =item * ICU Character Mapping Tables L<http://site.icu-project.org/charts/charset> =item * ICU:Conversion Data L<http://www.icu-project.org/userguide/conversion-data.html> =back =head1 SEE ALSO L<Encode>, L<perlmod>, L<perlpod> =cut # -Q to disable the duplicate codepoint test # -S make mapping errors fatal # -q to remove comments written to output files # -O to enable the (brute force) substring optimiser # -o <output> to specify the output file name (else it's the first arg) # -f <inlist> to give a file with a list of input files (else use the args) # -n <name> to name the encoding (else use the basename of the input file. With %seen holding array refs: 865.66 real 28.80 user 8.79 sys 7904 maximum resident set size 1356 average shared memory size 18566 average unshared data size 229 average unshared stack size 46080 page reclaims 33373 page faults With %seen holding simple scalars: 342.16 real 27.11 user 3.54 sys 8388 maximum resident set size 1394 average shared memory size 14969 average unshared data size 236 average unshared stack size 28159 page reclaims 9839 page faults Yes, 5 minutes is faster than 15. Above is for CP936 in CN. Only difference is how %seen is storing things its seen. So it is pathalogically bad on a 16M RAM machine, but it's going to help even on modern machines. Swapping is bad, m'kay :-)
SILENT KILLER Tool