To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??揖??楡??檍?????矣??暗?? 111001010100000100111111001111111001011101001011001111110011111110011110101111100011111100111111100111101111100000111111001111110011111100111111001111111110000111100001001111110011111110001000110000110011111100111111 e5413f3f974b3f3f9ebe3f3f9ef83f3f3f3f3fe1e13f3f88c33f3f
EUC-JP 蘂??揖??楡??檍?????矣??暗?? 111010011010001000111111001111111100110110101100001111110011111111011100110000000011111100111111110111001111101000111111001111110011111100111111001111111110001011100011001111110011111110110000110001010011111100111111 e9a23f3fcdac3f3fdcc03f3fdcfa3f3f3f3f3fe2e33f3fb0c53f3f
UTF-8 蘂띠눖揖겻슭楡⒲걶檍됰챶吏롳쬅矣⑸룇暗싲옟 111010001001100010000010111010111001110110100000111010111000100010010110111001101000111110010110111010101011001010111011111011001000101010101101111001101010010110100001111000101001001010110010111010101011000110110110111001101010101010001101111010111001000010110000111011001011000110110110111011111010011110011110111010111010000110110011111011001010110010000101111001111001111110100011111000101001000110111000111010111010001110000111111001101001101010010111111011001000101110110010111011001001100010011111 e89882eb9da0eb8896e68f96eab2bbec8aade6a5a1e292b2eab1b6e6aa8deb90b0ecb1b6efa79eeba1b3ecac85e79fa3e291b8eba387e69a97ec8bb2ec989f
UHC 蘂띠눖揖겻슭楡⒲걶檍됰챶吏롳쬅矣⑸룇暗싲옟 111001111101111010110110111011001000011110110000111010111110011110110000111001001011110110111110111010101111100010101001111000111000000110011100111001011110010110001001111010111010101010000011111011001010011110001110111011111010011010011100111010111111100010101001111010111000111110000110111001001101111010011010111010111001111010100001 e7deb6ec87b0ebe7b0e4bdbeeaf8a9e3819ce5e589ebaa83eca78eefa69cebf8a9eb8f86e4de9aeb9ea1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)