To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雅??鼇??億?????壹??泳??鵝??? 10001001111010110011111100111111111010101000011100111111001111111000100110101101001111110011111100111111001111110011111110011010111000110011111100111111100010010110101000111111001111111110101001000000001111110011111100111111 89eb3f3fea873f3f89ad3f3f3f3f3f9ae33f3f896a3f3fea403f3f3f
EUC-JP 雅??鼇??億?????壹??泳??鵝??? 10110010111011010011111100111111111100111110011100111111001111111011001010101111001111110011111100111111001111110011111111010100111001010011111100111111101100011100101100111111001111111111001110100001001111110011111100111111 b2ed3f3ff3e73f3fb2af3f3f3f3f3fd4e53f3fb1cb3f3ff3a13f3f3f
UTF-8 雅먮젙鼇귣젲億됰툍杻듣튋壹쒕젻泳볢듂鵝롥굄溜 111010011001101110000101111010111010100010101110111011001010000010011001111010011011110010000111111010101011011110100011111011001010000010110010111001011000010010000100111010111001000010110000111011011000100010001101111011111010011110001000111010111001001110100011111011011000101010001011111001011010001110111001111011001001001010010101111011001010000010111011111001101011001110110011111010111011001110100010111010111001001110000010111010011011010110011101111010111010000110100101111010101011010110000100111011111010011110001011 e99b85eba8aeeca099e9bc87eab7a3eca0b2e58484eb90b0ed888defa788eb93a3ed8a8be5a3b9ec9295eca0bbe6b3b3ebb3a2eb9382e9b59deba1a5eab584efa78b
UHC 雅먮젙鼇귣젲億됰툍杻듣튋壹쒕젻泳볢듂鵝롥굄溜 1110010010111010100100001110101110100000100101011110100010101000100000101110101110100000101001101110010111100010100010011110101110111000100001011110101011110100101101011110100010111001100111111110110011101100100111001110101110100000101011101110011110110110100100111110100010001010101101111110010010111101100011101110010110110001101011111110101011111110 e4ba90eba095e8a882eba0a6e5e289ebb885eaf4b5e8b99fecec9ceba0aee7b693e88ab7e4bd8ee5b1afeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)