To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 壤?????楡??[壤?????楡??[^ 10011010110111110011111100111111001111110011111100111111100111101011111000111111001111110101101110011010110111110011111100111111001111110011111100111111100111101011111000111111001111110101101101011110 9adf3f3f3f3f3f9ebe3f3f5b9adf3f3f3f3f3f9ebe3f3f5b5e
EUC-JP 壤?????楡??[壤?????楡??[^ 11010100111000010011111100111111001111110011111100111111110111001100000000111111001111110101101111010100111000010011111100111111001111110011111100111111110111001100000000111111001111110101101101011110 d4e13f3f3f3f3fdcc03f3f5bd4e13f3f3f3f3fdcc03f3f5b5e
UTF-8 壤쀫슩六쇔퐮楡뀀뻤[壤쀫슩六쇔퐮楡뀀뻤[^ 111001011010001110100100111011001000000010101011111011001000101010101001111011111010011110010001111011001000011110010100111011011001000010101110111001101010010110100001111010111000000010000000111010111011101110100100010110111110010110100011101001001110110010000000101010111110110010001010101010011110111110100111100100011110110010000111100101001110110110010000101011101110011010100101101000011110101110000000100000001110101110111011101001000101101101011110 e5a3a4ec80abec8aa9efa791ec8794ed90aee6a5a1eb8080ebbba45be5a3a4ec80abec8aa9efa791ec8794ed90aee6a5a1eb8080ebbba45b5e
UHC 壤쀫슩六쇔퐮楡뀀뻤[壤쀫슩六쇔퐮楡뀀뻤[^ 111001011011110110010111111010111001101010110010111010111011101110111100111001011011110110010111111010101111100010110010111010111011101110111100010110111110010110111101100101111110101110011010101100101110101110111011101111001110010110111101100101111110101011111000101100101110101110111011101111000101101101011110 e5bd97eb9ab2ebbbbce5bd97eaf8b2ebbbbc5be5bd97eb9ab2ebbbbce5bd97eaf8b2ebbbbc5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)