To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????Uh??????U 001111110011111100111111001111110011111100111111010101010110100000111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f55683f3f3f3f3f3f55
SJIS-WIN 辰袖存誰遜揃Uh辰袖存誰遜揃U 100100100100001110010001101100111001000110110110100100100100111010010001101110111001000110110101010101010110100010010010010000111001000110110011100100011011011010010010010011101001000110111011100100011011010101010101 924391b391b6924e91bb91b55568924391b391b6924e91bb91b555
EUC-JP 辰袖存誰遜揃Uh辰袖存誰遜揃U 110000111010010011000010101101011100001010111000110000111010111111000010101111011100001010110111010101010110100011000011101001001100001010110101110000101011100011000011101011111100001010111101110000101011011101010101 c3a4c2b5c2b8c3afc2bdc2b75568c3a4c2b5c2b8c3afc2bdc2b755
UTF-8 辰袖存誰遜揃Uh辰袖存誰遜揃U 111010001011111010110000111010001010001010010110111001011010110110011000111010001010101010110000111010011000000110011100111001101000111110000011010101010110100011101000101111101011000011101000101000101001011011100101101011011001100011101000101010101011000011101001100000011001110011100110100011111000001101010101 e8beb0e8a296e5ad98e8aab0e9819ce68f835568e8beb0e8a296e5ad98e8aab0e9819ce68f8355
UHC 辰袖存誰遜?Uh辰袖存誰遜?U 11110010111000111110001011000000111100001110110111100010110000011110000111100001001111110101010101101000111100101110001111100010110000001111000011101101111000101100000111100001111000010011111101010101 f2e3e2c0f0ede2c1e1e13f5568f2e3e2c0f0ede2c1e1e13f55

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)