To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??援ζ???????儒??怨k?筌??異 111001001000100000111111001111111000100110000111100000111100010000111111001111110011111100111111001111110011111100111111100011101111001000111111001111111000100110000101100000101000101100111111111000101010001100111111001111111000100011011001 e4883f3f898783c43f3f3f3f3f3f3f8ef23f3f8985828b3fe2a33f3f88d9
EUC-JP 艾??援ζ????孼??儒??怨k?筌??異 1110011111101000001111110011111110110001111001111010011011000110001111110011111100111111001111111000111110111010110000110011111100111111101111001111010000111111001111111011000111100101101000111110101100111111111001001010010100111111001111111011000011011011 e7e83f3fb1e7a6c63f3f3f3f8fbac33f3fbcf43f3fb1e5a3eb3fe4a53f3fb0db
UTF-8 艾싳궇援ζ젔琉뷩걶孼꾩쥙儒껆넭怨k룲筌뗪염異 1110100010001001101111101110110010001011101100111110101010110110100001111110011010001111101101001100111010110110111011001010000010010100111011111010011110001100111010111011011110101001111010101011000110110110111001011010110110111100111010101011111010101001111011001010010110011001111001011000010010010010111010101011101110000110111010111000010010101101111001101000000010101000111011111011110110001011111010111010001110110010111001111010110110001100111010111001011110101010111011001001011110111100111001111001010110110000 e889beec8bb3eab687e68fb4ceb6eca094efa78cebb7a9eab1b6e5adbceabea9eca599e58492eabb86eb84ade680a8efbd8beba3b2e7ad8ceb97aaec97bce795b0
UHC 艾싳궇援ζ젔琉뷩걶孼꾩쥙儒껆넭怨k룲筌뗪염異 1110010011110101100110101110110010000010101000001110101010110101101001011110011010100000100100101110101110100100101110101110001110000001100111001110010111101101100001001110110010100010100011101110101011100011100000111110011110000110101011001110101010110011101000111110101110001111101001111110111110100111100010111110101010111111101100001110110010110110 e4f59aec82a0eab5a5e6a092eba4bae3819ce5ed84eca28eeae383e786aceab3a3eb8fa7efa78beabfb0ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)