To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 余??誼ら?濡??筌??誼??儀??業 100101110101110100111111001111111000101101100010100000101110011100111111100101000100011100111111001111111110001010100011001111110011111110001011011000100011111100111111100010110101011000111111001111111000101111000110 975d3f3f8b6282e73f94473f3fe2a33f3f8b623f3f8b563f3f8bc6
EUC-JP 余??誼ら?濡??筌??誼??儀??業 110011011011111000111111001111111011010111000011101001001110100100111111110001111010100000111111001111111110010010100101001111110011111110110101110000110011111100111111101101011011011100111111001111111011011011001000 cdbe3f3fb5c3a4e93fc7a83f3fe4a53f3fb5c33f3fb5b73f3fb6c8
UTF-8 余쒖뮆誼ら턁濡⑸깽筌뗫베誼숁썫儀뺤젶業 111001001011110110011001111011001001001010010110111010111010111010000110111010001010101010111100111000111000001010001001111011011000010010000001111001101011111110100001111000101001000110111000111010101011100110111101111001111010110110001100111010111001011110101011111010111011001010100000111010001010101010111100111011001000100010000001111011001000110110101011111001011000010010000000111010111011101010100100111011001010000010110110111001101010010110101101 e4bd99ec9296ebae86e8aabce38289ed8481e6bfa1e291b8eab9bde7ad8ceb97abebb2a0e8aabcec8881ec8dabe58480ebbaa4eca0b6e6a5ad
UHC 余쒖뮆誼ら턁濡⑸깽筌뗫베誼숁썫儀뺤젶業 1110010111111001100111001110110010010010100101011110101111111110101010101110100110110101100111011110101110100001101010011110101110110010101001001110111110100111100010111110101110111010101000111110101111111110100110011110011010011011100111001110101111110000100101011110110010100000101010101110010111110110 e5f99cec9295ebfeaae9b59deba1a9ebb2a4efa78bebbaa3ebfe99e69b9cebf095eca0aae5f6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)