To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???陰??儒??[???陰??儒??[^ 00111111001111110011111110001001010000010011111100111111100011101111001000111111001111110101101100111111001111110011111110001001010000010011111100111111100011101111001000111111001111110101101101011110 3f3f3f89413f3f8ef23f3f5b3f3f3f89413f3f8ef23f3f5b5e
EUC-JP ???陰??儒??[???陰??儒??[^ 00111111001111110011111110110001101000100011111100111111101111001111010000111111001111110101101100111111001111110011111110110001101000100011111100111111101111001111010000111111001111110101101101011110 3f3f3fb1a23f3fbcf43f3f5b3f3f3fb1a23f3fbcf43f3f5b5e
UTF-8 閭잙뀞陰룡췃儒뺤넽[閭잙뀞陰룡췃儒뺤넽[^ 111011111010011010000110111011001001111010011001111010111000000010011110111010011001100110110000111010111010001110100001111011001011011110000011111001011000010010010010111010111011101010100100111010111000010010111101010110111110111110100110100001101110110010011110100110011110101110000000100111101110100110011001101100001110101110100011101000011110110010110111100000111110010110000100100100101110101110111010101001001110101110000100101111010101101101011110 efa686ec9e99eb809ee999b0eba3a1ecb783e58492ebbaa4eb84bd5befa686ec9e99eb809ee999b0eba3a1ecb783e58492ebbaa4eb84bd5b5e
UHC 閭잙뀞陰룡췃儒뺤넽[閭잙뀞陰룡췃儒뺤넽[^ 111001101010110110011111111010111000010110010101111010111110010010110111111001101010110110011111111010101110001110010101111011001000011010110111010110111110011010101101100111111110101110000101100101011110101111100100101101111110011010101101100111111110101011100011100101011110110010000110101101110101101101011110 e6ad9feb8595ebe4b7e6ad9feae395ec86b75be6ad9feb8595ebe4b7e6ad9feae395ec86b75b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)