To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 顆懷雀蔗ア隲キ頡、髯應鄧蜚ア隲キ遐ソ^ 11101000111101111001110011100101100100001001110111100100111100101011000111101000101010111011011111101000111101001010010011101001100110011001110011100100111110111011100111100101100101001011000111101000101010111011011111100111101000001011111101011110 e8f79ce5909de4f2b1e8abb7e8f4a4e9999ce4fbb9e594b1e8abb7e7a0bf5e
EUC-JP 顆懷雀蔗ア隲キ頡、髯應鄧蜚ア隲キ遐ソ^ 1111000011111001110110001110011110111111111111011110100011110100100011101011000111110000101011011000111010110111111100001111011010001110101001001111000111111001110110001110011010001111111000101100011111101001111101001000111010110001111100001010110110001110101101111110111010100010100011101011111101011110 f0f9d8e7bffde8f48eb1f0ad8eb7f0f68ea4f1f9d8e68fe2c7e9f48eb1f0ad8eb7eea28ebf5e
UTF-8 顆懷雀蔗ア隲キ頡、髯應鄧蜚ア隲キ遐ソ^ 11101001101000011000011011100110100001111011011111101001100110111000000011101000100101001001011111101111101111011011000111101001100110101011001011101111101111011011011111101001101000001010000111101111101111011010010011101001101010111010111111100110100001111000100111101001100001001010011111101000100111001001101011101111101111011011000111101001100110101011001011101111101111011011011111101001100000011001000011101111101111011011111101011110 e9a186e687b7e99b80e89497efbdb1e99ab2efbdb7e9a0a1efbda4e9abafe68789e984a7e89c9aefbdb1e99ab2efbdb7e98190efbdbf5e
UHC 顆懷雀蔗??????應鄧蜚???遐?^ 110011101010100011111100111000111110110111001101111011011011110100111111001111110011111100111111001111110011111111101011111010111101010011111000110111101010010000111111001111110011111111111001110001100011111101011110 cea8fce3edcdedbd3f3f3f3f3f3febebd4f8dea43f3f3ff9c63f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)