To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ?遐荊????け?[?遐荊????け?[^ 001111111110011110100000100011000111010000111111001111110011111100111111100000101010111100111111010110110011111111100111101000001000110001110100001111110011111100111111001111111000001010101111001111110101101101011110 3fe7a08c743f3f3f3f82af3f5b3fe7a08c743f3f3f3f82af3f5b5e
EUC-JP ?遐荊????け?[?遐荊????け?[^ 001111111110111010100010101101111101010100111111001111110011111100111111101001001011000100111111010110110011111111101110101000101011011111010101001111110011111100111111001111111010010010110001001111110101101101011110 3feea2b7d53f3f3f3fa4b13f5b3feea2b7d53f3f3f3fa4b13f5b5e
UTF-8 뤋遐荊콒쫸샘폼け렑[뤋遐荊콒쫸샘폼け렑[^ 111010111010010010001011111010011000000110010000111010001000110110001010111011001011110110010010111011001010101110111000111011001000001110011000111011011000111110111100111000111000000110010001111010111010000010010001010110111110101110100100100010111110100110000001100100001110100010001101100010101110110010111101100100101110110010101011101110001110110010000011100110001110110110001111101111001110001110000001100100011110101110100000100100010101101101011110 eba48be98190e88d8aecbd92ecabb8ec8398ed8fbce38191eba0915beba48be98190e88d8aecbd92ecabb8ec8398ed8fbce38191eba0915b5e
UHC 뤋遐荊콒쫸샘폼け렑[뤋遐荊콒쫸샘폼け렑[^ 100011111011101111111001110001101111101110101010101100011000111010100110100011111011101111111001110001101111101110101010101100011000111010100110010110111000111110111011111110011100011011111011101010101011000110001110101001101000111110111011111110011100011011111011101010101011000110001110101001100101101101011110 8fbbf9c6fbaab18ea68fbbf9c6fbaab18ea65b8fbbf9c6fbaab18ea68fbbf9c6fbaab18ea65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)