To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??意h?矣?グ???悠??儒??蘂?? 1110101001000000001111110011111110001000110100111000001010001000001111111110000111100001001111111000001101001111001111110011111100111111100101110100100100111111001111111000111011110010001111110011111111100101010000010011111100111111 ea403f3f88d382883fe1e13f834f3f3f3f97493f3f8ef23f3fe5413f3f
EUC-JP 鵝??意h?矣?グ???悠??儒??蘂?? 1111001110100001001111110011111110110000110101011010001111101000001111111110001011100011001111111010010110110000001111110011111100111111110011011010101000111111001111111011110011110100001111110011111111101001101000100011111100111111 f3a13f3fb0d5a3e83fe2e33fa5b03f3f3fcdaa3f3fbcf43f3fe9a23f3f
UTF-8 鵝숈뮆意h뵯矣몄グ銳얜슣悠당땸儒얠맼蘂뚯칳 111010011011010110011101111011001000100010001000111010111010111010000110111001101000010010001111111011111011110110001000111010111011010110101111111001111001111110100011111010111010101010000100111000111000001010110000111010011000101010110011111011001001011010011100111011001000101010100011111001101000001010100000111010111000101110111001111010111001010110111000111001011000010010010010111011001001011010100000111010111010011110111100111010001001100010000010111010111001101010101111111011001011100110110011 e9b59dec8888ebae86e6848fefbd88ebb5afe79fa3ebaa84e382b0e98ab3ec969cec8aa3e682a0eb8bb9eb95b8e58492ec96a0eba7bce89882eb9aafecb9b3
UHC 鵝숈뮆意h뵯矣몄グ銳얜슣悠당땸儒얠맼蘂뚯칳 111001001011110110011001111011001001001010010101111010111111001010100011111010001001010010101101111010111111100010111000111011001010101110110000111001111110010110111110111010111001101010101111111010101110110110110100111001111000101110001110111010101110001110111110111011001001000010111101111001111101111010001100111011001010111110000110 e4bd99ec9295ebf2a3e894adebf8b8ecabb0e7e5beeb9aafeaedb4e78b8eeae3beec90bde7de8cecaf86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)