To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??????揖???l?鶯??????陰 1110100111110010001111110011111100111111001111110011111100111111100101110100101100111111001111110011111110000010100011000011111111101001111100100011111100111111001111110011111100111111001111111000100101000001 e9f23f3f3f3f3f3f974b3f3f3f828c3fe9f23f3f3f3f3f3f8941
EUC-JP 鶯???獒??揖??洹l?鶯???獒??陰 1111001011110100001111110011111100111111100011111100101110111011001111110011111111001101101011000011111100111111100011111100011110111010101000111110110000111111111100101111010000111111001111110011111110001111110010111011101100111111001111111011000110100010 f2f43f3f3f8fcbbb3f3fcdac3f3f8fc7baa3ec3ff2f43f3f3f8fcbbb3f3fb1a2
UTF-8 鶯ㅺ퉮횞獒뺣뛼揖썽넫洹l꽫鶯ㅺ퉮횞獒뺣뛾陰 111010011011011010101111111000111000010110111010111011011000100110101110111011011001101010011110111001111000110110010010111010111011101010100011111010111001101110111100111001101000111110010110111011001000110110111101111010111000010010101011111001101011010010111001111011111011110110001100111010101011110110101011111010011011011010101111111000111000010110111010111011011000100110101110111011011001101010011110111001111000110110010010111010111011101010100011111010111001101110111110111010011001100110110000 e9b6afe385baed89aeed9a9ee78d92ebbaa3eb9bbce68f96ec8dbdeb84abe6b4b9efbd8ceabdabe9b6afe385baed89aeed9a9ee78d92ebbaa3eb9bbee999b0
UHC 鶯ㅺ퉮횞獒뺣뛼揖썽넫洹l꽫鶯ㅺ퉮횞獒뺣뛾陰 111001011010001110100100111010101011100110000110110000111001011111101000101000111001010111101011100011011000001011101011111001111011110111101001100001101010101111101010101101111010001111101100100001001011011011100101101000111010010011101010101110011000011011000011100101111110100010100011100101011110101110001101100001001110101111100100 e5a3a4eab986c397e8a395eb8d82ebe7bde986abeab7a3ec84b6e5a3a4eab986c397e8a395eb8d84ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)