To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厭レ?誼ゆ?純??窈 10001001011111011000001110001100001111111000101101100010100000101110010000111111100011111000001100111111001111111110001001110111 897d838c3f8b6282e43f8f833f3fe277
EUC-JP 厭レ?誼ゆ?純??窈 10110001110111101010010111101100001111111011010111000011101001001110011000111111101111011110001100111111001111111110001111011000 b1dea5ec3fb5c3a4e63fbde33f3fe3d8
UTF-8 厭レ쉶誼ゆ쉸純볦죦窈 111001011000111010101101111000111000001110101100111011001000100110110110111010001010101010111100111000111000001010000110111011001000100110111000111001111011010010010100111010111011001110100110111011001010001110100110111001111010101010001000 e58eade383acec89b6e8aabce38286ec89b8e7b494ebb3a6eca3a6e7aa88
UHC 厭レ쉶誼ゆ쉸純볦죦窈 1110011011110100101010111110110010011010100011001110101111111110101010101110011010011010100011101110001011101101100100111110110010100001100000011110100110100001 e6f4abec9a8cebfeaae69a8ee2ed93eca181e9a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)