To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕??矣??繹??幼??純??繹??毅 111000011001111100111111001111111001011101010100001111110011111111100001111000010011111100111111111000111000100000111111001111111001011101100011001111110011111110001111100000110011111100111111111000111000100000111111001111111000101101000010 e19f3f3f97543f3fe1e13f3fe3883f3f97633f3f8f833f3fe3883f3f8b42
EUC-JP 癲??裕??矣??繹??幼??純??繹??毅 111000101010000100111111001111111100110110110101001111110011111111100010111000110011111100111111111001011110100000111111001111111100110111000100001111110011111110111101111000110011111100111111111001011110100000111111001111111011010110100011 e2a13f3fcdb53f3fe2e33f3fe5e83f3fcdc43f3fbde33f3fe5e83f3fb5a3
UTF-8 癲숈슦裕뗦룚矣섏뜪繹먮씮幼뗤튋純껉석繹먭쐴毅 111001111001100110110010111011001000100010001000111011001000101010100110111010001010001110010101111010111001011110100110111010111010001110011010111001111001111110100011111011001000010010001111111010111001110010101010111001111011100110111001111010111010100010101110111011001001010010101110111001011011100110111100111010111001011110100100111011011000101010001011111001111011010010010100111010101011101110001001111011001000010010011101111001111011100110111001111010111010100010101101111011001001000010110100111001101010111110000101 e799b2ec8888ec8aa6e8a395eb97a6eba39ae79fa3ec848feb9caae7b9b9eba8aeec94aee5b9bceb97a4ed8a8be7b494eabb89ec849de7b9b9eba8adec90b4e6af85
UHC 癲숈슦裕뗦룚矣섏뜪繹먮씮幼뗤튋純껉석繹먭쐴毅 1110111110100110100110011110110010011010101100001110101110101110100010111110011010001111100101101110101111111000100110001110110010001101101010111110011010111010100100001110101110011101101111111110101011101010100010111110010010111001100111111110001011101101100000111110101010111100101011101110011010111010100100001110101010111110101000011110101111110110 efa699ec9ab0ebae8be68f96ebf898ec8dabe6ba90eb9dbfeaea8be4b99fe2ed83eabcaee6ba90eabea1ebf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)