To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 谷贈袖奪孫他狸 1001001001001010100100011010000110010001101100111001001001000100100100011011011110010001101111001001001001001011 924a91a191b3924491b791bc924b
EUC-JP 谷贈袖奪孫他狸 1100001110101011110000101010001111000010101101011100001110100101110000101011100111000010101111101100001110101100 c3abc2a3c2b5c3a5c2b9c2bec3ac
UTF-8 谷贈袖奪孫他狸 111010001011000010110111111010001011010010001000111010001010001010010110111001011010010110101010111001011010110110101011111001001011101110010110111001111000101110111000 e8b0b7e8b488e8a296e5a5aae5adabe4bb96e78bb8
UHC 谷贈袖奪孫他狸 1100110111011011111100011111110011100010110000001111011110101100111000011101110111110110111000101101011111100001 cddbf1fce2c0f7ace1ddf6e2d7e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)