To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?脹??逕鈞?徹雰?脹??逕鈞?徹雰B 0011111110010010101011110011111100111111111001111001010011100111111000000011111110010011010011111001010110110101001111111001001010101111001111110011111111100111100101001110011111100000001111111001001101001111100101011011010101000010 3f92af3f3fe794e7e03f934f95b53f92af3f3fe794e7e03f934f95b542
EUC-JP ?脹??逕鈞?徹雰?脹??逕鈞?徹雰B 0011111111000100101100010011111100111111111011011111010011101110111000100011111111000101101100001100101010110111001111111100010010110001001111110011111111101101111101001110111011100010001111111100010110110000110010101011011101000010 3fc4b13f3fedf4eee23fc5b0cab73fc4b13f3fedf4eee23fc5b0cab742
UTF-8 뤋脹탲샅逕鈞뤋徹雰뤋脹탲샅逕鈞뤋徹雰B 11101011101001001000101111101000100001001011100111101101100000111011001011101100100000111000010111101001100000001001010111101001100010001001111011101011101001001000101111100101101111101011100111101001100110111011000011101011101001001000101111101000100001001011100111101101100000111011001011101100100000111000010111101001100000001001010111101001100010001001111011101011101001001000101111100101101111101011100111101001100110111011000001000010 eba48be884b9ed83b2ec8385e98095e9889eeba48be5beb9e99bb0eba48be884b9ed83b2ec8385e98095e9889eeba48be5beb9e99bb042
UHC 뤋脹탲샅逕鈞뤋徹雰뤋脹탲샅逕鈞뤋徹雰B 10001111101110111111001111101100101101011000111110111011111101001100110011101111110100001011011110001111101110111111010011001011110111011101010010001111101110111111001111101100101101011000111110111011111101001100110011101111110100001011011110001111101110111111010011001011110111011101010001000010 8fbbf3ecb58fbbf4ccefd0b78fbbf4cbddd48fbbf3ecb58fbbf4ccefd0b78fbbf4cbddd442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)