To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??歪??畏??榮??節③?熬ラ?歪??^ 10011010100010110011111100111111100110000110001100111111001111111000100011011000001111110011111110011110110001000011111100111111100100001101111110000111010000100011111111100000100100101000001110001001001111111001100001100011001111110011111101011110 9a8b3f3f98633f3f88d83f3f9ec43f3f90df87423fe09283893f98633f3f5e
EUC-JP 嚥??歪?˙畏??榮??節??熬ラ?歪??^ 1101001111101011001111110011111111001111110001000011111110001111101000101011001010110000110110100011111100111111110111001100011000111111001111111100000011100001001111110011111111011111111100101010010111101001001111111100111111000100001111110011111101011110 d3eb3f3fcfc43f8fa2b2b0da3f3fdcc63f3fc0e13f3fdff2a5e93fcfc43f3f5e
UTF-8 嚥듸쉔歪곮˙畏븅굙榮띷영節③굙熬ラ댖歪곭킀^ 111001011001101010100101111010111001001110111000111011001000100110010100111001101010110110101010111010101011001110101110110010111001100111100111100101011000111111101011101110001000010111101010101101011001100111100110101001101010111011101011100111011011011111101100100110001000000111100111101011111000000011100010100100011010001011101010101101011001100111100111100001101010110011100011100000111010100111101011100011001001011011100110101011011010101011101010101100111010110111101101100000101000000001011110 e59aa5eb93b8ec8994e6adaaeab3aecb99e7958febb885eab599e6a6aeeb9db7ec9881e7af80e291a2eab599e786ace383a9eb8c96e6adaaeab3aded82805e
UHC 嚥듸쉔歪곮˙畏븅굙榮띷영節③굙熬ラ댖歪곭킀^ 11100110101111111011010111101111101111011010100011101000111000001000000111101000101000101010101111101000111001101011101011101001100000101000000111100111101101001000110111100110101111111011010111101111101111011010100011101001100000101000000111101000101000101010101111101001100010001011101011101000111000001000000111100111101101001000110101011110 e6bfb5efbda8e8e081e8a2abe8e6bae98281e7b48de6bfb5efbda8e98281e8a2abe988bae8e081e7b48d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)