To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??????異?ⅰ純??畏??泣?? 11101001111100010011111100111111001111110011111100111111001111111000100011011001001111111111101001000000100011111000001100111111001111111000100011011000001111110011111110001011100000110011111100111111 e9f13f3f3f3f3f3f88d93ffa408f833f3f88d83f3f8b833f3f
EUC-JP 鴦???孼??異??純??畏??泣?? 1111001011110011001111110011111100111111100011111011101011000011001111110011111110110000110110110011111100111111101111011110001100111111001111111011000011011010001111110011111110110101111000110011111100111111 f2f33f3f3f8fbac33f3fb0db3f3fbde33f3fb0da3f3fb5e33f3f
UTF-8 鴦꾆쇰뼕孼뽯쓬異녔ⅰ純껋뮏畏브퀣泣숋쭛 111010011011010010100110111010101011111010000110111011001000011110110000111010111011110010010101111001011010110110111100111010111011110110101111111011001001001110101100111001111001010110110000111010111000010110010100111000101000010110110000111001111011010010010100111010101011101110001011111010111010111010001111111001111001010110001111111010111011100010001100111011011000000010100011111001101011001110100011111011001000100010001011111011001010110110011011 e9b4a6eabe86ec87b0ebbc95e5adbcebbdafec93ace795b0eb8594e285b0e7b494eabb8bebae8fe7958febb88ced80a3e6b3a3ec888becad9b
UHC 鴦꾆쇰뼕孼뽯쓬異녔ⅰ純껋뮏畏브퀣泣숋쭛 1110010011101100100001001100111010111100111010111001011010011101111001011110110110010110111010111001110110001100111011001011011010110011111001101010010110100001111000101110110110000011111011001001001010011100111010001110011010111010111010101011001110010111111010111110100010011001111011111010011110010001 e4ec84cebceb969de5ed96eb9d8cecb6b3e6a5a1e2ed83ec929ce8e6baeab397ebe899efa791

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)