To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???梧??要ょ?嚥ラ?哀??B 00111111001111110011111110001100111001100011111100111111100101110111011010000010111001010011111110011010100010111000001110001001001111111000100010100011001111110011111101000010 3f3f3f8ce63f3f977682e53f9a8b83893f88a33f3f42
EUC-JP 縕??梧??要ょ?嚥ラł哀??B 1000111111010100110000100011111100111111101110001110100000111111001111111100110111010111101001001110011100111111110100111110101110100101111010011000111110101001110010001011000010100101001111110011111101000010 8fd4c23f3fb8e83f3fcdd7a4e73fd3eba5e98fa9c8b0a53f3f42
UTF-8 縕뷂슉梧잞쉽要ょㅌ嚥ラł哀앭퍩B 111001111011100010010101111010111011011110000010111011001000101010001001111001101010001010100111111011001001111010011110111011001000100110111101111010001010011010000001111000111000001010000111111000111000010110001100111001011001101010100101111000111000001110101001110001011000001011100101100100111000000011101100100101011010110111101101100011011010100101000010 e7b895ebb782ec8a89e6a2a7ec9e9eec89bde8a681e38287e3858ce59aa5e383a9c582e59380ec95aded8da942
UHC 縕뷂슉梧잞쉽要ょㅌ嚥ラł哀앭퍩B 11101000101100101001010011101111101111011011010111100111111111001001111111101111101111011011000111101001101010011010101011100111101001001011110011100110101111111010101111101001101010011010100111100100111011101001110111100101101110111010000001000010 e8b294efbdb5e7fc9fefbdb1e9a9aae7a4bce6bfabe9a9a9e4ee9de5bba042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)