To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????????愛??????????愛??^ 00111111001111110011111100111111001111110011111100111111001111111000100010100100001111110011111100111111001111110011111100111111001111110011111100111111001111111000100010100100001111110011111101011110 3f3f3f3f3f3f3f3f88a43f3f3f3f3f3f3f3f3f3f88a43f3f5e
EUC-JP ????????愛??????????愛??^ 00111111001111110011111100111111001111110011111100111111001111111011000010100110001111110011111100111111001111110011111100111111001111110011111100111111001111111011000010100110001111110011111101011110 3f3f3f3f3f3f3f3fb0a63f3f3f3f3f3f3f3f3f3fb0a63f3f5e
UTF-8 렱렚섕렱렖셰렱렚愛서섕렱렚섕렱렖셰렱렚愛서섕^ 11101011101000001011000111101011101000001001101011101100100001001001010111101011101000001011000111101011101000001001011011101100100001011011000011101011101000001011000111101011101000001001101011100110100001001001101111101100100001001001110011101100100001001001010111101011101000001011000111101011101000001001101011101100100001001001010111101011101000001011000111101011101000001001011011101100100001011011000011101011101000001011000111101011101000001001101011100110100001001001101111101100100001001001110011101100100001001001010101011110 eba0b1eba09aec8495eba0b1eba096ec85b0eba0b1eba09ae6849bec849cec8495eba0b1eba09aec8495eba0b1eba096ec85b0eba0b1eba09ae6849bec849cec84955e
UHC 렱렚섕렱렖셰렱렚愛서섕렱렚섕렱렖셰렱렚愛서섕^ 100011101011111010001110101011011011110010101100100011101011111010001110101010111011110011001110100011101011111010001110101011011110010011110001101111001010110110111100101011001000111010111110100011101010110110111100101011001000111010111110100011101010101110111100110011101000111010111110100011101010110111100100111100011011110010101101101111001010110001011110 8ebe8eadbcac8ebe8eabbcce8ebe8eade4f1bcadbcac8ebe8eadbcac8ebe8eabbcce8ebe8eade4f1bcadbcac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)