To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??誼??怨??鴉??鍮???l??レ? 11100001100111110011111100111111100010110110001000111111001111111000100110000101001111110011111111101001111010110011111100111111111010000100101000111111001111110011111110000010100011000011111100111111100000111000110000111111 e19f3f3f8b623f3f89853f3fe9eb3f3fe84a3f3f3f828c3f3f838c3f
EUC-JP 癲??誼??怨??鴉??鍮???l??レ? 11100010101000010011111100111111101101011100001100111111001111111011000111100101001111110011111111110010111011010011111100111111111011111010101100111111001111110011111110100011111011000011111100111111101001011110110000111111 e2a13f3fb5c33f3fb1e53f3ff2ed3f3fefab3f3f3fa3ec3f3fa5ec3f
UTF-8 癲좊끏誼㏆쭔怨좉틚鴉딆늿鍮당랜類l젋曆レ룤 111001111001100110110010111011001010001010001010111010111000000110001111111010001010101010111100111000111000111110000110111011001010110110010100111001101000000010101000111011001010001010001001111011011000101110011010111010011011010010001001111010111001010010000110111010111000101010111111111010011000110110101110111010111000101110111001111010111001111010011100111011111010011110010000111011111011110110001100111011001010000010001011111011111010011010001011111000111000001110101100111010111010001110100100 e799b2eca28aeb818fe8aabce38f86ecad94e680a8eca289ed8b9ae9b489eb9486eb8abfe98daeeb8bb9eb9e9cefa790efbd8ceca08befa68be383aceba3a4
UHC 癲좊끏誼㏆쭔怨좉틚鴉딆늿鍮당랜類l젋曆レ룤 111011111010011010100000111010111000010110111111111010111111111010100111111011111010011110001100111010101011001110100000111010101011101010000111111001001011110010001010111011001000100010001000111010111011100110110100111001111011011110100011111010111011101010100011111011001010000010001100111001101011011110101011111011001000111110011101 efa6a0eb85bfebfea7efa78ceab3a0eaba87e4bc8aec8888ebb9b4e7b7a3ebbaa3eca08ce6b7abec8f9d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)