To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕??松??儼??泣??應??艾??魏 111000011001111100111111001111111001011101010100001111110011111110001111101111000011111100111111100110010101011000111111001111111000101110000011001111110011111110011100111001000011111100111111111001001000100000111111001111111110100110110000 e19f3f3f97543f3f8fbc3f3f99563f3f8b833f3f9ce43f3fe4883f3fe9b0
EUC-JP 癲??裕??松??儼??泣??應??艾??魏 111000101010000100111111001111111100110110110101001111110011111110111110101111100011111100111111110100011011011100111111001111111011010111100011001111110011111111011000111001100011111100111111111001111110100000111111001111111111001010110010 e2a13f3fcdb53f3fbebe3f3fd1b73f3fb5e33f3fd8e63f3fe7e83f3ff2b2
UTF-8 癲븍쵉裕끻섧松쏀뫔儼볥톪泣됬넲應쎈렱艾싲떼魏 111001111001100110110010111010111011100010001101111011001011010110001001111010001010001110010101111010111000000110111011111011001000010010100111111001101001110110111110111011001000111110000000111010111010101110010100111001011000010010111100111010111011001110100101111011011000011010101010111001101011001110100011111010111001000010101100111010111000010010110010111001101000011110001001111011001000111010001000111010111010000010110001111010001000100110111110111011001000101110110010111010111001011010111100111010011010110110001111 e799b2ebb88decb589e8a395eb81bbec84a7e69dbeec8f80ebab94e584bcebb3a5ed86aae6b3a3eb90aceb84b2e68789ec8e88eba0b1e889beec8bb2eb96bce9ad8f
UHC 癲븍쵉裕끻섧松쏀뫔儼볥톪泣됬넲應쎈렱艾싲떼魏 1110111110100110101110101110101110101100100010111110101110101110100001011110010110111100101101011110000111100110101111011110110110010001101101101110010111110000100100111110101110110111100000101110101111101000100010011110011110000110101100011110101111101011101111011110101110001110101111101110010011110101100110101110101110110110101111001110101011100000 efa6baebac8bebae85e5bcb5e1e6bded91b6e5f093ebb782ebe889e786b1ebebbdeb8ebee4f59aebb6bceae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)