To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 叱渉ォ漆上ー偲室叱渉ォ漆上ー偲偲E 111100101110100110001110101101101000111111000010101010111000111010111101100011111110001110110000100011101100001110001110101110101111001011101001100011101011011010001111110000101010101110001110101111011000111111100011101100001000111011000011100011101100001101000101 f2e98eb68fc2ab8ebd8fe3b08ec38ebaf2e98eb68fc2ab8ebd8fe3b08ec38ec345
EUC-JP ?叱渉ォ漆上ー偲室?叱渉ォ漆上ー偲偲E 0011111110111100101110001011111011000100100011101010101110111100101111111011111011100101100011101011000010111100110001011011110010111100001111111011110010111000101111101100010010001110101010111011110010111111101111101110010110001110101100001011110011000101101111001100010101000101 3fbcb8bec48eabbcbfbee58eb0bcc5bcbc3fbcb8bec48eabbcbfbee58eb0bcc5bcc545
UTF-8 叱渉ォ漆上ー偲室叱渉ォ漆上ー偲偲E 11101110100010001010000011100101100011111011000111100110101110001000100111101111101111011010101111100110101111001000011011100100101110001000101011101111101111011011000011100101100000011011001011100101101011101010010011101110100010001010000011100101100011111011000111100110101110001000100111101111101111011010101111100110101111001000011011100100101110001000101011101111101111011011000011100101100000011011001011100101100000011011001001000101 ee88a0e58fb1e6b889efbdabe6bc86e4b88aefbdb0e581b2e5aea4ee88a0e58fb1e6b889efbdabe6bc86e4b88aefbdb0e581b2e581b245
UHC ?叱??漆上??室?叱??漆上???E 0011111111110010111010100011111100111111111101101101010011011111101111100011111100111111111000111111100000111111111100101110101000111111001111111111011011010100110111111011111000111111001111110011111101000101 3ff2ea3f3ff6d4dfbe3f3fe3f83ff2ea3f3ff6d4dfbe3f3f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)