To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳶??違??魏??艶j?儒?????筌?? 10010011110011100011111100111111100010001110000100111111001111111110100110110000001111110011111110001001100100001000001010001010001111111000111011110010001111110011111100111111001111110011111111100010101000110011111100111111 93ce3f3f88e13f3fe9b03f3f8990828a3f8ef23f3f3f3f3fe2a33f3f
EUC-JP 鳶??違??魏??艶j?儒??洧??筌?? 110001101101000000111111001111111011000011100011001111110011111111110010101100100011111100111111101100011111000010100011111010100011111110111100111101000011111100111111100011111100011110110100001111110011111111100100101001010011111100111111 c6d03f3fb0e33f3ff2b23f3fb1f0a3ea3fbcf43f3f8fc7b43f3fe4a53f3f
UTF-8 鳶롫끏違욇씘魏됱쵇艶j쑴儒됵쫵洧뺤돟筌륂꼩 111010011011001110110110111010111010000110101011111010111000000110001111111010011000000110010101111011001001101010000111111011001001010010011000111010011010110110001111111010111001000010110001111011001011010110000111111010001000100110110110111011111011110110001010111011001001000110110100111001011000010010010010111010111001000010110101111011001010101110110101111001101011010010100111111010111011101010100100111010111000111110011111111001111010110110001100111010111010010110000010111010101011110010101001 e9b3b6eba1abeb818fe98195ec9a87ec9498e9ad8feb90b1ecb587e889b6efbd8aec91b4e58492eb90b5ecabb5e6b4a7ebbaa4eb8f9fe7ad8ceba582eabca9
UHC 鳶롫끏違욇씘魏됱쵇艶j쑴儒됵쫵洧뺤돟筌륂꼩 111001101110100110001110111010111000010110111111111010101101111010011110111010011001110110101101111010101110000010001001111011001010110010001001111001101111110110100011111010101011111010101001111010101110001110001001111011111010011010001100111010101111101110010101111011001000100110100101111011111010011110001111111011011000010010000110 e6e98eeb85bfeade9ee99dadeae089ecac89e6fda3eabea9eae389efa68ceafb95ec89a5efa78fed8486

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)