To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8意??袁l????????維??兪?? 1110000110011111001111111000001001010111100010001101001100111111001111111110010111001101100000101000110000111111001111110011111100111111001111110011111100111111001111111000100011011011001111110011111110011001011000000011111100111111 e19f3f825788d33f3fe5cd828c3f3f3f3f3f3f3f3f88db3f3f99603f3f
EUC-JP 癲?8意??袁l?孼??????維??兪?? 11100010101000010011111110100011101110001011000011010101001111110011111111101010110011111010001111101100001111111000111110111010110000110011111100111111001111110011111100111111001111111011000011011101001111110011111111010001110000010011111100111111 e2a13fa3b8b0d53f3feacfa3ec3f8fbac33f3f3f3f3f3fb0dd3f3fd1c13f3f
UTF-8 癲쒕8意덅굢袁l뵯孼꾊랁룍劣겸뮦維볢땻兪귙럹 111001111001100110110010111011001001001010010101111011111011110010011000111001101000010010001111111010111000110110000101111010101011010110100010111010001010001010000001111011111011110110001100111010111011010110101111111001011010110110111100111010101011111010001010111010111001111010000001111010111010001110001101111011111010011010011101111010101011001010111000111010111010111010100110111001111011011010101101111010111011001110100010111010111001010110111011111001011000010110101010111010101011011110011001111010111001111110111001 e799b2ec9295efbc98e6848feb8d85eab5a2e8a281efbd8cebb5afe5adbceabe8aeb9e81eba38defa69deab2b8ebaea6e7b6adebb3a2eb95bbe585aaeab799eb9fb9
UHC 癲쒕8意덅굢袁l뵯孼꾊랁룍劣겸뮦維볢땻兪귙럹 1110111110100110100111001110101110100011101110001110101111110010100010001110100010000010100010011110101010111110101000111110110010010100101011011110010111101101100001001101000110001101111011011000111110001011111001101110101110110000111000101001001010110001111010111010101110010011111010001000101110010001111010101110010010000010111000111000111010011000 efa69ceba3b8ebf288e88289eabea3ec94ade5ed84d18ded8f8be6ebb0e292b1ebab93e88b91eae482e38e98

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)