To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?純ヲ??コ?功ぃ?ぁ 0011111110001111100000111000001110010010001111110011111110000011010100100011111110001100111101111000001010100001001111111000001010011111 3f8f8383923f3f83523f8cf782a13f829f
EUC-JP ?純ヲ??コ?功ぃ?ぁ 0011111110111101111000111010010111110010001111110011111110100101101100110011111110111000111110011010010010100011001111111010010010100001 3fbde3a5f23f3fa5b33fb8f9a4a33fa4a1
UTF-8 룶純ヲ룶殺コ룴功ぃ룶ぁ 111010111010001110110110111001111011010010010100111000111000001110110010111010111010001110110110111011111010010110110000111000111000001010110011111010111010001110110100111001011000101010011111111000111000000110000011111010111010001110110110111000111000000110000001 eba3b6e7b494e383b2eba3b6efa5b0e382b3eba3b4e58a9fe38183eba3b6e38181
UHC 룶純ヲ룶殺コ룴功ぃ룶ぁ 10001111101010111110001011101101101010111111001010001111101010111110000111101101101010111011001110001111101010011100110111101101101010101010001110001111101010111010101010100001 8fabe2edabf28fabe1edabb38fa9cdedaaa38fabaaa1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)