To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????¿???????????? 001111110011111100111111001111110011111100111111001111110011111110111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3fbf3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 曄??松?㏄????????。裔??怨?? 100111100100000000111111001111111000111110111100001111111000011101110100001111110011111100111111001111110011111100111111001111110011111110000001010000101110010111100001001111110011111110001001100001010011111100111111 9e403f3f8fbc3f87743f3f3f3f3f3f3f3f8142e5e13f3f89853f3f
EUC-JP 曄??松????¿?????。裔??怨?? 11011011101000010011111100111111101111101011111000111111001111110011111100111111100011111010001011000100001111110011111100111111001111110011111110100001101000111110101011100011001111110011111110110001111001010011111100111111 dba13f3fbebe3f3f3f3f8fa2c43f3f3f3f3fa1a3eae33f3fb1e53f3f
UTF-8 曄됯퀡松㏆㏄溜쀦¿溜잏뼇溜묋。裔녿젨怨꾨젷 1110011010011011100001001110101110010000101011111110110110000000101000011110011010011101101111101110001110001111100001101110001110001111100001001110111110100111100010111110110010000000101001101100001010111111111011111010011110001011111011001001111010001111111010111011110010000111111011111010011110001011111010111010110010001011111000111000000010000010111010001010001110010100111010111000010110111111111011001010000010101000111001101000000010101000111010101011111010101000111011001010000010110111 e69b84eb90afed80a1e69dbee38f86e38f84efa78bec80a6c2bfefa78bec9e8febbc87efa78bebac8be38082e8a394eb85bfeca0a8e680a8eabea8eca0b7
UHC 曄됯퀡松㏆㏄溜쀦¿溜잏뼇溜묋。裔녿젨怨꾨젷 111001111010010110001001111010101011001110010101111000011110011010100111111011111010011110100110111010101111111010010111111001101010001010101111111010101111111010011111111001111001011010010001111010101111111010010001111010001010000110100011111001111110000010000110111010111010000010100000111010101011001110000100111010111010000010101011 e7a589eab395e1e6a7efa7a6eafe97e6a2afeafe9fe79691eafe91e8a1a3e7e086eba0a0eab384eba0ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)