To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鞫ッ逍セ遐・鬥エn}鞫ッ逍セ遐・鬥エn{^ 1110011010010001101011111110011110010110101111101110011110100000101001011110100110100110101101000110111001111101111001101001000110101111111001111001011010111110111001111010000010100101111010011010011010110100011011100111101101011110 e691afe796bee7a0a5e9a6b46e7de691afe796bee7a0a5e9a6b46e7b5e
EUC-JP 鞫ッ逍セ遐・鬥エn}鞫ッ逍セ遐・鬥エn{^ 11101011111100011000111010101111111011011111011010001110101111101110111010100010100011101010010111110010101010001000111010110100011011100111110111101011111100011000111010101111111011011111011010001110101111101110111010100010100011101010010111110010101010001000111010110100011011100111101101011110 ebf18eafedf68ebeeea28ea5f2a88eb46e7debf18eafedf68ebeeea28ea5f2a88eb46e7b5e
UTF-8 鞫ッ逍セ遐・鬥エn}鞫ッ逍セ遐・鬥エn{^ 1110100110011110101010111110111110111101101011111110100110000000100011011110111110111101101111101110100110000001100100001110111110111101101001011110100110101100101001011110111110111101101101000110111001111101111010011001111010101011111011111011110110101111111010011000000010001101111011111011110110111110111010011000000110010000111011111011110110100101111010011010110010100101111011111011110110110100011011100111101101011110 e99eabefbdafe9808defbdbee98190efbda5e9aca5efbdb46e7de99eabefbdafe9808defbdbee98190efbda5e9aca5efbdb46e7b5e
UHC 鞫?逍?遐???n}鞫?逍?遐???n{^ 110011111101010000111111111000011100111000111111111110011100011000111111001111110011111101101110011111011100111111010100001111111110000111001110001111111111100111000110001111110011111100111111011011100111101101011110 cfd43fe1ce3ff9c63f3f3f6e7dcfd43fe1ce3ff9c63f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)