To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?弔???伊豆?矜?伊逗?籠∧??衣?? 10011111110001000011111110010010101000100011111100111111001111111000100011001001100100111010010000111111111000011110000000111111100010001100100110010000100000000011111111100010110001001000000111001000001111110011111110001000110111110011111100111111 9fc43f92a23f3f3f88c993a43fe1e03f88c990803fe2c481c83f3f88df3f3f
EUC-JP 淨?弔???伊豆?矜?伊逗?籠∧??衣?? 11011110110001100011111111000100101001000011111100111111001111111011000011001011110001101010011000111111111000101110001000111111101100001100101110111111111000000011111111100100110001101010001011001010001111110011111110110000111000010011111100111111 dec63fc4a43f3f3fb0cbc6a63fe2e23fb0cbbfe03fe4c6a2ca3f3fb0e13f3f
UTF-8 淨렠弔렟罹렗伊豆렚矜썬伊逗렫籠∧亐렕衣쯔렢 111001101011011110101000111010111010000010100000111001011011110010010100111010111010000010011111111011111010011110100110111010111010000010010111111001001011110010001010111010001011000110000110111010111010000010011010111001111001111110011100111011001000110110101100111001001011110010001010111010011000000010010111111010111010000010101011111001111011000110100000111000101000100010100111111001001011101010010000111010111010000010010101111010001010000110100011111011001010111110010100111010111010000010100010 e6b7a8eba0a0e5bc94eba09fefa7a6eba097e4bc8ae8b186eba09ae79f9cec8dace4bc8ae98097eba0abe7b1a0e288a7e4ba90eba095e8a1a3ecaf94eba0a2
UHC 淨렠弔렟罹렗伊豆렚矜썬伊逗렫籠∧亐렕衣쯔렢 111011111110010010001110101100011111000011000000100011101011000011101100101110101000111010101100111011001010010111010100111001111000111010101101110100001110100010111101111000111110110010100101110101001110100010001110101110011101011011101011101000011111110011101010101001111000111010101010111010111111110111000010111010101000111010110011 efe48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3eca5d4e88eb9d6eba1fceaa78eaaebfdc2ea8eb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)