To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??猷?ぜ誘l?閻??誼??儒??沃 11101010010000000011111100111111100101110101000100111111100000101011101010010111010101011000001010001100001111111110100010000101001111110011111110001011011000100011111100111111100011101111001000111111001111111001011110000000 ea403f3f97513f82ba9755828c3fe8853f3f8b623f3f8ef23f3f9780
EUC-JP 鵝??猷?ぜ誘l?閻??誼??儒??沃 11110011101000010011111100111111110011011011001000111111101001001011110011001101101101101010001111101100001111111110111111100101001111110011111110110101110000110011111100111111101111001111010000111111001111111100110111100000 f3a13f3fcdb23fa4bccdb6a3ec3fefe53f3fb5c33f3fbcf43f3fcde0
UTF-8 鵝숈뮇猷녻ぜ誘l뒙閻롫챷誼붼뒽儒우벞沃 111010011011010110011101111011001000100010001000111010111010111010000111111001111000110010110111111010111000010110111011111000111000000110011100111010001010101010011000111011111011110110001100111010111001001010011001111010011001011010111011111010111010000110101011111011001011000110110111111010001010101010111100111010111011011010111100111010111001001010111101111001011000010010010010111011001001101010110000111010111011001010011110111001101011001010000011 e9b59dec8888ebae87e78cb7eb85bbe3819ce8aa98efbd8ceb9299e996bbeba1abecb1b7e8aabcebb6bceb92bde58492ec9ab0ebb29ee6b283
UHC 鵝숈뮇猷녻ぜ誘l뒙閻롫챷誼붼뒽儒우벞沃 1110010010111101100110011110110010010010100101101110101110100011100001101110100010101010101111001110101110101111101000111110110010001010100101101110011110100010100011101110101110101010100001001110101111111110100101001110100110001010101100111110101011100011101111111110110010010011101110011110100010101010 e4bd99ec9296eba386e8aabcebafa3ec8a96e7a28eebaa84ebfe94e98ab3eae3bfec93b9e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)