To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 形??頭?頭?妖?? 1000110001100000001111110011111110010011101010100011111110010011101010100011111110010111011001000011111100111111 8c603f3f93aa3f93aa3f97643f3f
EUC-JP 形?祜頭?頭?妖?? 10110111110000010011111110001111110100001101100011000110101011000011111111000110101011000011111111001101110001010011111100111111 b7c13f8fd0d8c6ac3fc6ac3fcdc53f3f
UTF-8 形렠祜頭렧頭떵妖쾀렮 111001011011110110100010111010111010000010100000111001111010010110011100111010011010000010101101111010111010000010100111111010011010000010101101111010111001011010110101111001011010011010010110111011001011111010000000111010111010000010101110 e5bda2eba0a0e7a59ce9a0adeba0a7e9a0adeb96b5e5a696ecbe80eba0ae
UHC 形렠祜頭렧頭떵妖쾀렮 1111101110100001100011101011000111111011110101001101010011101001100011101011011011010100111010011011011010111010111010001110110111000100111001101000111010111011 fba18eb1fbd4d4e98eb6d4e9b6bae8edc4e68ebb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)