To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????????領?????????肌 0011111100111111001111110011111100111111001111110011111100111111001111110011111110010111110011000011111100111111001111110011111100111111001111110011111100111111001111111001010010100111 3f3f3f3f3f3f3f3f3f3f97cc3f3f3f3f3f3f3f3f3f94a7
EUC-JP ??????????領?????????肌 0011111100111111001111110011111100111111001111110011111100111111001111110011111111001110110011100011111100111111001111110011111100111111001111110011111100111111001111111100100010101001 3f3f3f3f3f3f3f3f3f3fcece3f3f3f3f3f3f3f3f3fc8a9
UTF-8 렻ㅔ씽ㅔ렪렻┙렰렻┙領ㅔ씽ㅔ렪렻┙렰렻┙肌 111010111010000010111011111000111000010110010100111011001001010010111101111000111000010110010100111010111010000010101010111010111010000010111011111000101001010010011001111010111010000010110000111010111010000010111011111000101001010010011001111010011010000010011000111000111000010110010100111011001001010010111101111000111000010110010100111010111010000010101010111010111010000010111011111000101001010010011001111010111010000010110000111010111010000010111011111000101001010010011001111010001000001010001100 eba0bbe38594ec94bde38594eba0aaeba0bbe29499eba0b0eba0bbe29499e9a098e38594ec94bde38594eba0aaeba0bbe29499eba0b0eba0bbe29499e8828c
UHC 렻ㅔ씽ㅔ렪렻┙렰렻┙領ㅔ씽ㅔ렪렻┙렰렻┙肌 100011101100001110100100110001001011111011000101101001001100010010001110101110001000111011000011101001101100010010001110101111011000111011000011101001101100010011010110110001011010010011000100101111101100010110100100110001001000111010111000100011101100001110100110110001001000111010111101100011101100001110100110110001001101000110111111 8ec3a4c4bec5a4c48eb88ec3a6c48ebd8ec3a6c4d6c5a4c4bec5a4c48eb88ec3a6c48ebd8ec3a6c4d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)