To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????w??????k 0011111100111111001111110011111100111111001111110111011100111111001111110011111100111111001111110011111101101011 3f3f3f3f3f3f773f3f3f3f3f3f6b
SJIS-WIN 辰袖存誰遜卒w辰袖存誰遜促k 1001001001000011100100011011001110010001101101101001001001001110100100011011101110010001101100100111011110010010010000111001000110110011100100011011011010010010010011101001000110111011100100011010001101101011 924391b391b6924e91bb91b277924391b391b6924e91bb91a36b
EUC-JP 辰袖存誰遜卒w辰袖存誰遜促k 1100001110100100110000101011010111000010101110001100001110101111110000101011110111000010101101000111011111000011101001001100001010110101110000101011100011000011101011111100001010111101110000101010010101101011 c3a4c2b5c2b8c3afc2bdc2b477c3a4c2b5c2b8c3afc2bdc2a56b
UTF-8 辰袖存誰遜卒w辰袖存誰遜促k 1110100010111110101100001110100010100010100101101110010110101101100110001110100010101010101100001110100110000001100111001110010110001101100100100111011111101000101111101011000011101000101000101001011011100101101011011001100011101000101010101011000011101001100000011001110011100100101111111000001101101011 e8beb0e8a296e5ad98e8aab0e9819ce58d9277e8beb0e8a296e5ad98e8aab0e9819ce4bf836b
UHC 辰袖存誰遜卒w辰袖存誰遜促k 1111001011100011111000101100000011110000111011011110001011000001111000011110000111110000111011110111011111110010111000111110001011000000111100001110110111100010110000011110000111100001111101011011010101101011 f2e3e2c0f0ede2c1e1e1f0ef77f2e3e2c0f0ede2c1e1e1f5b56b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)