To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 谷造他竪束尊竪息 10010010010010101001000110100010100100011011110010010010010001111001000110101001100100011011100010010010010001111001000110100111 924a91a291bc924791a991b8924791a7
EUC-JP 谷造他竪束尊竪息 11000011101010111100001010100100110000101011111011000011101010001100001010101011110000101011101011000011101010001100001010101001 c3abc2a4c2bec3a8c2abc2bac3a8c2a9
UTF-8 谷造他竪束尊竪息 111010001011000010110111111010011000000010100000111001001011101110010110111001111010101110101010111001101001110110011111111001011011000010001010111001111010101110101010111001101000000110101111 e8b0b7e980a0e4bb96e7abaae69d9fe5b08ae7abaae681af
UHC 谷造他竪束尊竪息 11001101110110111111000011100011111101101110001011100010101101011110000111010110111100001110111011100010101101011110001111010011 cddbf0e3f6e2e2b5e1d6f0eee2b5e3d3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)