To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????DD 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100010001000100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4444
SJIS-WIN ???徇??節②?域よ?阿??沃??DD 001111110011111100111111100111000110110100111111001111111001000011011111100001110100000100111111100010001110011010000010111001100011111110001000101000100011111100111111100101111000000000111111001111110100010001000100 3f3f3f9c6d3f3f90df87413f88e682e63f88a23f3f97803f3f4444
EUC-JP 邕??徇??節??域よ?阿??沃??DD 10001111111000011110110100111111001111111101011111001110001111110011111111000000111000010011111100111111101100001110100010100100111010000011111110110000101001000011111100111111110011011110000000111111001111110100010001000100 8fe1ed3f3fd7ce3f3fc0e13f3fb0e8a4e83fb0a43f3fcde03f3f4444
UTF-8 邕뤶튊徇먬뼚節②퍘域よ퍘阿욁걧沃㏆쉔DD 1110100110000010100101011110101110100100101101101110110110001010100010101110010110111110100001111110101110101000101011001110101110111100100110101110011110101111100000001110001010010001101000011110110110001101100110001110010110011111100111111110001110000010100010001110110110001101100110001110100110011000101111111110110010011010100000011110101010110001101001111110011010110010100000111110001110001111100001101110110010001001100101000100010001000100 e98295eba4b6ed8a8ae5be87eba8acebbc9ae7af80e291a1ed8d98e59f9fe38288ed8d98e998bfec9a81eab1a7e6b283e38f86ec89944444
UHC 邕뤶튊徇먬뼚節②퍘域よ퍘阿욁걧沃㏆쉔DD 1110100010111011100011111110010010111001100111101110001011011111100100001110100110010110101000001110111110111101101010001110100010111011100011111110011010110100101010101110100010111011100011111110010010111001100111101110001110000001100100001110100010101010101001111110111110111101101010000100010001000100 e8bb8fe4b99ee2df90e996a0efbda8e8bb8fe6b4aae8bb8fe4b99ee38190e8aaa7efbda84444

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)