To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????h???????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f
SJIS-WIN 偲セト宍偲叱偲璽偲クナ磁h偲セト宍偲叱偲璽 1000111011000011101111101100010010001110101100111000111011000011100011101011011010001110110000111000111010100011100011101100001110111000110001011000111010100101011010001000111011000011101111101100010010001110101100111000111011000011100011101011011010001110110000111000111010100011 8ec3bec48eb38ec38eb68ec38ea38ec3b8c58ea5688ec3bec48eb38ec38eb68ec38ea3
EUC-JP 偲セト宍偲叱偲璽偲クナ磁h偲セト宍偲叱偲璽 1011110011000101100011101011111010001110110001001011110010110101101111001100010110111100101110001011110011000101101111001010010110111100110001011000111010111000100011101100010110111100101001110110100010111100110001011000111010111110100011101100010010111100101101011011110011000101101111001011100010111100110001011011110010100101 bcc58ebe8ec4bcb5bcc5bcb8bcc5bca5bcc58eb88ec5bca768bcc58ebe8ec4bcb5bcc5bcb8bcc5bca5
UTF-8 偲セト宍偲叱偲璽偲クナ磁h偲セト宍偲叱偲璽 11100101100000011011001011101111101111011011111011101111101111101000010011100101101011101000110111100101100000011011001011100101100011111011000111100101100000011011001011100111100100101011110111100101100000011011001011101111101111011011100011101111101111101000010111100111101000111000000101101000111001011000000110110010111011111011110110111110111011111011111010000100111001011010111010001101111001011000000110110010111001011000111110110001111001011000000110110010111001111001001010111101 e581b2efbdbeefbe84e5ae8de581b2e58fb1e581b2e792bde581b2efbdb8efbe85e7a38168e581b2efbdbeefbe84e5ae8de581b2e58fb1e581b2e792bd
UHC ?????叱?璽???磁h?????叱?璽 0011111100111111001111110011111100111111111100101110101000111111110111111101111000111111001111110011111111101101101110000110100000111111001111110011111100111111001111111111001011101010001111111101111111011110 3f3f3f3f3ff2ea3fdfde3f3f3fedb8683f3f3f3f3ff2ea3fdfde

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)