To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??蘂??源?艾??竊?? 00111111001111110011111100111111001111110011111110011010011001110011111100111111111001010100000100111111001111111000110010111001001111111110010010001000001111110011111111100010100001100011111100111111 3f3f3f3f3f3f9a673f3fe5413f3f8cb93fe4883f3fe2863f3f
EUC-JP ???馹??喩??蘂??源?艾??竊?? 001111110011111100111111100011111110100110100001001111110011111111010011110010000011111100111111111010011010001000111111001111111011100010111011001111111110011111101000001111110011111111100011111001100011111100111111 3f3f3f8fe9a13f3fd3c83f3fe9a23f3fb8bb3fe7e83f3fe3e63f3f
UTF-8 列룸쑙馹쇿뿽喩뽳폇蘂뚢븠源텽艾쎈돃竊뺠틡 111011111010011010011100111010111010001110111000111011001001000110011001111010011010011010111001111011001000011110111111111010111011111110111101111001011001011010101001111010111011110110110011111011011000111110000111111010001001100010000010111010111001101010100010111010111011100010100000111001101011101010010000111011011000010110111101111010001000100110111110111011001000111010001000111010111000111110000011111001111010101110001010111010111011101010100000111011011000101110100001 efa69ceba3b8ec9199e9a6b9ec87bfebbfbde596a9ebbdb3ed8f87e89882eb9aa2ebb8a0e6ba90ed85bde889beec8e88eb8f83e7ab8aebbaa0ed8ba1
UHC 列룸쑙馹쇿뿽喩뽳폇蘂뚢븠源텽艾쎈돃竊뺠틡 11100110111010101011011111101011100111001011100011101100111100011001100111100101100101111011110111101010111001111001011011101111101111001001010011100111110111101000110011100010100101011000100111101010101110011011011101001111111001001111010110111101111010111000100110010110111011111011110010010101111010001011101010001101 e6eab7eb9cb8ecf199e597bdeae796efbc94e7de8ce29589eab9b74fe4f5bdeb8996efbc95e8ba8d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)