To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汚??梧??要ょ?嚥ラ?哀??B 1000100110011000001111110011111110001100111001100011111100111111100101110111011010000010111001010011111110011010100010111000001110001001001111111000100010100011001111110011111101000010 89983f3f8ce63f3f977682e53f9a8b83893f88a33f3f42
EUC-JP 汚??梧??要ょ?嚥ラł哀??B 10110001111110000011111100111111101110001110100000111111001111111100110111010111101001001110011100111111110100111110101110100101111010011000111110101001110010001011000010100101001111110011111101000010 b1f83f3fb8e83f3fcdd7a4e73fd3eba5e98fa9c8b0a53f3f42
UTF-8 汚뉛쉬梧잞쉽要ょㅌ嚥ラł哀앭퍩B 111001101011000110011010111010111000100110011011111011001000100110101100111001101010001010100111111011001001111010011110111011001000100110111101111010001010011010000001111000111000001010000111111000111000010110001100111001011001101010100101111000111000001110101001110001011000001011100101100100111000000011101100100101011010110111101101100011011010100101000010 e6b19aeb899bec89ace6a2a7ec9e9eec89bde8a681e38287e3858ce59aa5e383a9c582e59380ec95aded8da942
UHC 汚뉛쉬梧잞쉽要ょㅌ嚥ラł哀앭퍩B 11100111111111011000011111101111101111011010110011100111111111001001111111101111101111011011000111101001101010011010101011100111101001001011110011100110101111111010101111101001101010011010100111100100111011101001110111100101101110111010000001000010 e7fd87efbdace7fc9fefbdb1e9a9aae7a4bce6bfabe9a9a9e4ee9de5bba042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)