To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯???鶯??陰??癒???????㏄飮?ザ 1110100111110010001111110011111100111111111010011111001000111111001111111000100101000001001111110011111110010110111111000011111100111111001111110011111100111111001111110011111110000111011101001001111101011010001111111000001101010101 e9f23f3f3fe9f23f3f89413f3f96fc3f3f3f3f3f3f3f87749f5a3f8355
EUC-JP 鶯???鶯??陰??癒????????飮?ザ 11110010111101000011111100111111001111111111001011110100001111110011111110110001101000100011111100111111110011001111111000111111001111110011111100111111001111110011111100111111001111111101110110111011001111111010010110110110 f2f43f3f3ff2f43f3fb1a23f3fccfe3f3f3f3f3f3f3f3fddbb3fa5b6
UTF-8 鶯ㅺ퉮횞鶯밸뱺陰덆걬癒곕쳯麗몃쓹留잞㏄飮뗭ザ 111010011011011010101111111000111000010110111010111011011000100110101110111011011001101010011110111010011011011010101111111010111011000010111000111010111011000110111010111010011001100110110000111010111000110110000110111010101011000110101100111001111001100110010010111010101011001110010101111011001011001110101111111011111010011010001000111010111010101010000011111011001001001110111001111011111010011110001101111011001001111010011110111000111000111110000100111010011010001110101110111010111001011110101101111000111000001010110110 e9b6afe385baed89aeed9a9ee9b6afebb0b8ebb1bae999b0eb8d86eab1ace79992eab395ecb3afefa688ebaa83ec93b9efa78dec9e9ee38f84e9a3aeeb97ade382b6
UHC 鶯ㅺ퉮횞鶯밸뱺陰덆걬癒곕쳯麗몃쓹留잞㏄飮뗭ザ 1110010110100011101001001110101010111001100001101100001110010111111001011010001110111001111010111001001110100000111010111110010010001000111010011000000110010101111010111010100010110000111010111010101110010011111001101011000010111000111010111001110110010101111010111010011110011111111011111010011110100110111010111110011010001011111011001010101110110110 e5a3a4eab986c397e5a3b9eb93a0ebe488e98195eba8b0ebab93e6b0b8eb9d95eba79fefa7a6ebe68becabb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)