To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 旋嘶旋?旋承旋陞 100100001111100110011010011111001001000011111001001111111001000011111001100011111011001110010000111110011110100010011110 90f99a7c90f93f90f98fb390f9e89e
EUC-JP 旋嘶旋渲旋承旋陞 1100000011111011110100111101110111000000111110111000111111000111111011001100000011111011101111101011010111000000111110111110111111111110 c0fbd3ddc0fb8fc7ecc0fbbeb5c0fbeffe
UTF-8 旋嘶旋渲旋承旋陞 111001101001011110001011111001011001100010110110111001101001011110001011111001101011100010110010111001101001011110001011111001101000100110111111111001101001011110001011111010011001100110011110 e6978be598b6e6978be6b8b2e6978be689bfe6978be9999e
UHC 旋嘶旋渲旋承旋陞 11100000110000011110001110110110111000001100000111100000110000101110000011000001111000111010111111100000110000011110001110110011 e0c1e3b6e0c1e0c2e0c1e3afe0c1e3b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)