To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????h?????? 00111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111 3f3f3f3f3f3f683f3f3f3f3f3f
SJIS-WIN 谷贈其狸造俗h谷贈其狸造俗 10010010010010101001000110100001100100011011010010010010010010111001000110100010100100011010110101101000100100100100101010010001101000011001000110110100100100100100101110010001101000101001000110101101 924a91a191b4924b91a291ad68924a91a191b4924b91a291ad
EUC-JP 谷贈其狸造俗h谷贈其狸造俗 11000011101010111100001010100011110000101011011011000011101011001100001010100100110000101010111101101000110000111010101111000010101000111100001010110110110000111010110011000010101001001100001010101111 c3abc2a3c2b6c3acc2a4c2af68c3abc2a3c2b6c3acc2a4c2af
UTF-8 谷贈其狸造俗h谷贈其狸造俗 11101000101100001011011111101000101101001000100011100101100001011011011011100111100010111011100011101001100000001010000011100100101111111001011101101000111010001011000010110111111010001011010010001000111001011000010110110110111001111000101110111000111010011000000010100000111001001011111110010111 e8b0b7e8b488e585b6e78bb8e980a0e4bf9768e8b0b7e8b488e585b6e78bb8e980a0e4bf97
UHC 谷贈其狸造俗h谷贈其狸造俗 11001101110110111111000111111100110100001110110011010111111000011111000011100011111000011101010001101000110011011101101111110001111111001101000011101100110101111110000111110000111000111110000111010100 cddbf1fcd0ecd7e1f0e3e1d468cddbf1fcd0ecd7e1f0e3e1d4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)