To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ð¹Þä°­î´Ý¼ð¹Þä°­î´Ý¼^ 111100001011100111011110111001001011000010101101111011101011010011011101101111001111000010111001110111101110010010110000101011011110111010110100110111011011110001011110 f0b9dee4b0adeeb4ddbcf0b9dee4b0adeeb4ddbc5e
SJIS-WIN ????°??´??????°??´??^ 00111111001111110011111100111111100000011000101100111111001111111000000101001100001111110011111100111111001111110011111100111111100000011000101100111111001111111000000101001100001111110011111101011110 3f3f3f3f818b3f3f814c3f3f3f3f3f3f818b3f3f814c3f3f5e
EUC-JP ð?Þä°?î´Ý?ð?Þä°?î´Ý?^ 100011111010100111000011001111111000111110101001101100001000111110101011101000111010000111101011001111111000111110101011110000101010000110101101100011111010101011110010001111111000111110101001110000110011111110001111101010011011000010001111101010111010001110100001111010110011111110001111101010111100001010100001101011011000111110101010111100100011111101011110 8fa9c33f8fa9b08faba3a1eb3f8fabc2a1ad8faaf23f8fa9c33f8fa9b08faba3a1eb3f8fabc2a1ad8faaf23f5e
UTF-8 ð¹Þä°­î´Ý¼ð¹Þä°­î´Ý¼^ 1100001110110000110000101011100111000011100111101100001110100100110000101011000011000010101011011100001110101110110000101011010011000011100111011100001010111100110000111011000011000010101110011100001110011110110000111010010011000010101100001100001010101101110000111010111011000010101101001100001110011101110000101011110001011110 c3b0c2b9c39ec3a4c2b0c2adc3aec2b4c39dc2bcc3b0c2b9c39ec3a4c2b0c2adc3aec2b4c39dc2bc5e
UHC ð¹Þ?°­?´?¼ð¹Þ?°­?´?¼^ 1010100110100011101010011111011010101000101011010011111110100001110001101010000110101001001111111010001010100101001111111010100011111001101010011010001110101001111101101010100010101101001111111010000111000110101000011010100100111111101000101010010100111111101010001111100101011110 a9a3a9f6a8ad3fa1c6a1a93fa2a53fa8f9a9a3a9f6a8ad3fa1c6a1a93fa2a53fa8f95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)