To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????c[?????????c[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001101011011001111110011111100111111001111110011111100111111001111110011111100111111011000110101101101011110 3f3f3f3f3f3f3f3f3f635b3f3f3f3f3f3f3f3f3f635b5e
SJIS-WIN ?????????c[?????????c[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001101011011001111110011111100111111001111110011111100111111001111110011111100111111011000110101101101011110 3f3f3f3f3f3f3f3f3f635b3f3f3f3f3f3f3f3f3f635b5e
EUC-JP ?????????c[?????????c[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001101011011001111110011111100111111001111110011111100111111001111110011111100111111011000110101101101011110 3f3f3f3f3f3f3f3f3f635b3f3f3f3f3f3f3f3f3f635b5e
UTF-8 렻ㅔ셔┙팍ㅔ셔ㅔ렍c[렻ㅔ셔┙팍ㅔ셔ㅔ렍c[^ 1110101110100000101110111110001110000101100101001110110010000101100101001110001010010100100110011110110110001100100011011110001110000101100101001110110010000101100101001110001110000101100101001110101110100000100011010110001101011011111010111010000010111011111000111000010110010100111011001000010110010100111000101001010010011001111011011000110010001101111000111000010110010100111011001000010110010100111000111000010110010100111010111010000010001101011000110101101101011110 eba0bbe38594ec8594e29499ed8c8de38594ec8594e38594eba08d635beba0bbe38594ec8594e29499ed8c8de38594ec8594e38594eba08d635b5e
UHC 렻ㅔ셔┙팍ㅔ셔ㅔ렍c[렻ㅔ셔┙팍ㅔ셔ㅔ렍c[^ 1000111011000011101001001100010010111100110001011010011011000100110001101100010110100100110001001011110011000101101001001100010010001110101000110110001101011011100011101100001110100100110001001011110011000101101001101100010011000110110001011010010011000100101111001100010110100100110001001000111010100011011000110101101101011110 8ec3a4c4bcc5a6c4c6c5a4c4bcc5a4c48ea3635b8ec3a4c4bcc5a6c4c6c5a4c4bcc5a4c48ea3635b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)