To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?純い?釐ダ◆??ュ?忍?純い?釐ダ◆??ュ?寅^ 001111111000111110000011100000101010001000111111111001111101100010000011010111111000000110011111001111110011111110000011100001010011111110010100010001010011111110001111100000111000001010100010001111111110011111011000100000110101111110000001100111110011111100111111100000111000010100111111100100111101000001011110 3f8f8382a23fe7d8835f819f3f3f83853f94453f8f8382a23fe7d8835f819f3f3f83853f93d05e
EUC-JP ?純い?釐ダ◆??ュ?忍?純い?釐ダ◆??ュ?寅^ 001111111011110111100011101001001010010000111111111011101101101010100101110000001010001010100001001111110011111110100101111001010011111111000111101001100011111110111101111000111010010010100100001111111110111011011010101001011100000010100010101000010011111100111111101001011110010100111111110001101101001001011110 3fbde3a4a43feedaa5c0a2a13f3fa5e53fc7a63fbde3a4a43feedaa5c0a2a13f3fa5e53fc6d25e
UTF-8 룶純い룵釐ダ◆룶쥚ュ룫忍룶純い룵釐ダ◆룶쥚ュ룫寅^ 11101011101000111011011011100111101101001001010011100011100000011000010011101011101000111011010111101001100001111001000011100011100000111000000011100010100101111000011011101011101000111011011011101100101001011001101011100011100000111010010111101011101000111010101111100101101111111000110111101011101000111011011011100111101101001001010011100011100000011000010011101011101000111011010111101001100001111001000011100011100000111000000011100010100101111000011011101011101000111011011011101100101001011001101011100011100000111010010111101011101000111010101111100101101011111000010101011110 eba3b6e7b494e38184eba3b5e98790e38380e29786eba3b6eca59ae383a5eba3abe5bf8deba3b6e7b494e38184eba3b5e98790e38380e29786eba3b6eca59ae383a5eba3abe5af855e
UHC 룶純い룵釐ダ◆룶쥚ュ룫忍룶純い룵釐ダ◆룶쥚ュ룫寅^ 10001111101010111110001011101101101010101010010010001111101010101101011111101101101010111100000010100001110111111000111110101011101000101000111110101011111001011000111110100010111011001101101110001111101010111110001011101101101010101010010010001111101010101101011111101101101010111100000010100001110111111000111110101011101000101000111110101011111001011000111110100010111011001101100101011110 8fabe2edaaa48faad7edabc0a1df8faba28fabe58fa2ecdb8fabe2edaaa48faad7edabc0a1df8faba28fabe58fa2ecd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)