To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 巽短卒誰遜促巽存族奪損多巽足袖辿息 10010010010001101001001001011010100100011011001010010010010011101001000110111011100100011010001110010010010001101001000110110110100100011011000010010010010001001001000110111001100100011011110110010010010001101001000110101011100100011011001110010010010010001001000110100111 9246925a91b2924e91bb91a3924691b691b0924491b991bd924691ab91b3924891a7
EUC-JP 巽短卒誰遜促巽存族奪損多巽足袖辿息 11000011101001111100001110111011110000101011010011000011101011111100001010111101110000101010010111000011101001111100001010111000110000101011001011000011101001011100001010111011110000101011111111000011101001111100001010101101110000101011010111000011101010011100001010101001 c3a7c3bbc2b4c3afc2bdc2a5c3a7c2b8c2b2c3a5c2bbc2bfc3a7c2adc2b5c3a9c2a9
UTF-8 巽短卒誰遜促巽存族奪損多巽足袖辿息 111001011011011110111101111001111001111110101101111001011000110110010010111010001010101010110000111010011000000110011100111001001011111110000011111001011011011110111101111001011010110110011000111001101001011110001111111001011010010110101010111001101001000010001101111001011010010010011010111001011011011110111101111010001011011010110011111010001010001010010110111010001011111010111111111001101000000110101111 e5b7bde79fade58d92e8aab0e9819ce4bf83e5b7bde5ad98e6978fe5a5aae6908de5a49ae5b7bde8b6b3e8a296e8bebfe681af
UHC 巽短卒誰遜促巽存族奪損多巽足袖?息 111000011101111011010011101011011111000011101111111000101100000111100001111000011111010110110101111000011101111011110000111011011111000011101001111101111010110011100001110111111101001011111101111000011101111011110000111010111110001011000000001111111110001111010011 e1ded3adf0efe2c1e1e1f5b5e1def0edf0e9f7ace1dfd2fde1def0ebe2c03fe3d3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)