ALT Linux Team development discussions
 help / color / mirror / Atom feed
From: Alexey Tourbin <at@altlinux.ru>
To: ALT Devel discussion list <devel@altlinux.org>
Subject: [devel] Re: cmdcache hash function
Date: Thu, 15 Sep 2005 13:26:51 +0400
Message-ID: <20050915092651.GT3290@solemn.turbinal.org> (raw)
In-Reply-To: <20050914171004.GO3290@solemn.turbinal.org>

[-- Attachment #1: Type: text/plain, Size: 1659 bytes --]

On Wed, Sep 14, 2005 at 09:10:04PM +0400, Alexey Tourbin wrote:
> Букв нет, есть только st_ino+st_size+st_mtime.
> 
> at@solemn ~/.cmdcache 4 $ ls -1 rpmfile |head
> i10003s62465m1079085926
> i10226s323862m1079960217
> i10287s367587m1080138012
> i10366s1535556m1077197737
> i10552s68368m1077197738
> i10580s107898m1078145421
> i10723s828128m1076422307
> i1090192s13734m1125334673
> i11055s406941m1079109059
> i1166331s4611m1125335339
> at@solemn ~/.cmdcache 4 $
> 
> К тому же по буквам получится плохое распределение, а кеш всё равно
> "не прозрачный" (то есть reverse lookup затруднён).  Можно сделать
> одноуровневую прослойку из каталогов по последним цифрам ino и size.
> Получится 100 файлов на 100 каталогов для 10k capacity.  Надо посмотреть
> как в ccache сделано.

Okay, есть три числа: st_ino, st_size и st_mtime.  Как их по-грамотному
захешировать?  Нужно сделать одноуровневую прослойку из каталогов.  Для
capacity порядка 10k выбираем число каталогов (buckets) в районе 100:

$ for n in `seq 50 150`; do factor $n; done |awk NF==2
53: 53
59: 59
61: 61
67: 67
71: 71
73: 73
79: 79
83: 83
89: 89
97: 97
101: 101
103: 103
107: 107
109: 109
113: 113
127: 127
131: 131
137: 137
139: 139
149: 149
$

Значит хешируем по модулю например 97.

cmdcache_hash()
{
	local c_st
	c_st="$(stat -L -c '%i %s %Y' -- "$1")" || return
	set -- $c_st
	local c_hashfile="i${1}s${2}m${3}"
	local c_hashdir=$(( ( $1 + $2 + $3 ) % 97 ))
	echo "$c_hashdir/$c_hashfile"
}

bash-2.05b$ cmdcache_hash /etc/passwd
70/i22612s2957m1119749776
bash-2.05b$ exit

Коллеги!  Как по-грамотному три числа захешировать?

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

  reply	other threads:[~2005-09-15  9:26 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-09-12 11:35 [devel] cmdcache Alexey Tourbin
2005-09-12 13:05 ` [devel] cmdcache Alexey Tourbin
2005-09-12 13:34   ` Dmitry V. Levin
2005-09-12 14:01     ` Alexey Tourbin
2005-09-12 14:57     ` [devel] [JT] " Michael Shigorin
2005-09-12 18:35       ` [devel] " Alexey Tourbin
2005-09-13  5:21     ` [devel] " Alexey Tourbin
2005-09-14 15:35       ` Alexey Tourbin
2005-09-14 15:47         ` Michael Shigorin
2005-09-14 16:12           ` Alexey Tourbin
2005-09-14 16:38             ` Michael Shigorin
2005-09-14 17:10               ` Alexey Tourbin
2005-09-15  9:26                 ` Alexey Tourbin [this message]
2005-09-15 14:37                   ` [devel] Re: cmdcache hash function Alexey Tourbin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20050915092651.GT3290@solemn.turbinal.org \
    --to=at@altlinux.ru \
    --cc=devel@altlinux.org \
    --cc=devel@altlinux.ru \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

ALT Linux Team development discussions

This inbox may be cloned and mirrored by anyone:

	git clone --mirror http://lore.altlinux.org/devel/0 devel/git/0.git

	# If you have public-inbox 1.1+ installed, you may
	# initialize and index your mirror using the following commands:
	public-inbox-init -V2 devel devel/ http://lore.altlinux.org/devel \
		devel@altlinux.org devel@altlinux.ru devel@lists.altlinux.org devel@lists.altlinux.ru devel@linux.iplabs.ru mandrake-russian@linuxteam.iplabs.ru sisyphus@linuxteam.iplabs.ru
	public-inbox-index devel

Example config snippet for mirrors.
Newsgroup available over NNTP:
	nntp://lore.altlinux.org/org.altlinux.lists.devel


AGPL code for this site: git clone https://public-inbox.org/public-inbox.git