To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?忌????????げ?忌????????げ^ 001111111000101011110101001111110011111100111111001111110011111100111111001111110011111110000010101100000011111110001010111101010011111100111111001111110011111100111111001111110011111100111111100000101011000001011110 3f8af53f3f3f3f3f3f3f3f82b03f8af53f3f3f3f3f3f3f3f82b05e
EUC-JP ?忌????????げ?忌????????げ^ 001111111011010011110111001111110011111100111111001111110011111100111111001111110011111110100100101100100011111110110100111101110011111100111111001111110011111100111111001111110011111100111111101001001011001001011110 3fb4f73f3f3f3f3f3f3f3fa4b23fb4f73f3f3f3f3f3f3f3fa4b25e
UTF-8 룶忌룶濫풔룶濫퓽룶핊げ룶忌룶濫풔룶濫퓽룶핊げ^ 11101011101000111011011011100101101111111000110011101011101000111011011011101111101001001010001011101101100100101001010011101011101000111011011011101111101001001010001011101101100100111011110111101011101000111011011011101101100101011000101011100011100000011001001011101011101000111011011011100101101111111000110011101011101000111011011011101111101001001010001011101101100100101001010011101011101000111011011011101111101001001010001011101101100100111011110111101011101000111011011011101101100101011000101011100011100000011001001001011110 eba3b6e5bf8ceba3b6efa4a2ed9294eba3b6efa4a2ed93bdeba3b6ed958ae38192eba3b6e5bf8ceba3b6efa4a2ed9294eba3b6efa4a2ed93bdeba3b6ed958ae381925e
UHC 룶忌룶濫풔룶濫퓽룶핊げ룶忌룶濫풔룶濫퓽룶핊げ^ 100011111010101111010000111110111000111110101011110100011111101011000111101101001000111110101011110100011111101011000111110000001000111110101011110000001000111110101010101100101000111110101011110100001111101110001111101010111101000111111010110001111011010010001111101010111101000111111010110001111100000010001111101010111100000010001111101010101011001001011110 8fabd0fb8fabd1fac7b48fabd1fac7c08fabc08faab28fabd0fb8fabd1fac7b48fabd1fac7c08fabc08faab25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)