To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U?????????UB 001111110011111100111111001111110011111100111111001111110011111100111111010101010011111100111111001111110011111100111111001111110011111100111111001111110101010101000010 3f3f3f3f3f3f3f3f3f553f3f3f3f3f3f3f3f3f5542
SJIS-WIN 菴??泣?┸濡??U菴??泣?┸濡??UB 1110010010111101001111110011111110001011100000110011111110000100101111011001010001000111001111110011111101010101111001001011110100111111001111111000101110000011001111111000010010111101100101000100011100111111001111110101010101000010 e4bd3f3f8b833f84bd94473f3f55e4bd3f3f8b833f84bd94473f3f5542
EUC-JP 菴??泣?┸濡??U菴??泣?┸濡??UB 1110100010111111001111110011111110110101111000110011111110101000101111111100011110101000001111110011111101010101111010001011111100111111001111111011010111100011001111111010100010111111110001111010100000111111001111110101010101000010 e8bf3f3fb5e33fa8bfc7a83f3f55e8bf3f3fb5e33fa8bfc7a83f3f5542
UTF-8 菴뀀뵃泣덌┸濡녹돩U菴뀀뵃泣덌┸濡녹돩UB 111010001000111110110100111010111000000010000000111010111011010110000011111001101011001110100011111010111000110110001100111000101001010010111000111001101011111110100001111010111000010110111001111010111000111110101001010101011110100010001111101101001110101110000000100000001110101110110101100000111110011010110011101000111110101110001101100011001110001010010100101110001110011010111111101000011110101110000101101110011110101110001111101010010101010101000010 e88fb4eb8080ebb583e6b3a3eb8d8ce294b8e6bfa1eb85b9eb8fa955e88fb4eb8080ebb583e6b3a3eb8d8ce294b8e6bfa1eb85b9eb8fa95542
UHC 菴뀀뵃泣덌┸濡녹돩U菴뀀뵃泣덌┸濡녹돩UB 111001001110000010110010111010111001010010001001111010111110100010001000111011111010011010111111111010111010000110110011111011001000100110101100010101011110010011100000101100101110101110010100100010011110101111101000100010001110111110100110101111111110101110100001101100111110110010001001101011000101010101000010 e4e0b2eb9489ebe888efa6bfeba1b3ec89ac55e4e0b2eb9489ebe888efa6bfeba1b3ec89ac5542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)