To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 癲쑳삘닪阿녠만韋 111001111001100110110010111011001001000110110011111011001000001010011000111010111000101110101010111010011001100010111111111010111000010110100000111010111010011110001100111010011001111110001011 e799b2ec91b3ec8298eb8baae998bfeb85a0eba78ce99f8b
SJIS-WIN ???????????????????§???? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100000011001100000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f81983f3f3f3f
EUC-JP ç??ì??ì??ë?ªé?¿ë??ë§?é?? 100011111010101110101110001111110011111110001111101010111100000000111111001111111000111110101011110000000011111100111111100011111010101110110011001111111000111110100010111011001000111110101011101100010011111110001111101000101100010010001111101010111011001100111111001111111000111110101011101100111010000111111000001111111000111110101011101100010011111100111111 8fabae3f3f8fabc03f3f8fabc03f3f8fabb33f8fa2ec8fabb13f8fa2c48fabb33f3f8fabb3a1f83f8fabb13f3f
UTF-8 癲쑳삘닪阿녠만韋 110000111010011111000010100110011100001010110010110000111010110011000010100100011100001010110011110000111010110011000010100000101100001010011000110000111010101111000010100010111100001010101010110000111010100111000010100110001100001010111111110000111010101111000010100001011100001010100000110000111010101111000010101001111100001010001100110000111010100111000010100111111100001010001011 c3a7c299c2b2c3acc291c2b3c3acc282c298c3abc28bc2aac3a9c298c2bfc3abc285c2a0c3abc2a7c28cc3a9c29fc28b
UHC ??²??³?????ª??¿????§???? 0011111100111111101010011111011100111111001111111010100111111000001111110011111100111111001111110011111110101000101000110011111100111111101000101010111100111111001111110011111100111111101000011101011100111111001111110011111100111111 3f3fa9f73f3fa9f83f3f3f3f3fa8a33f3fa2af3f3f3f3fa1d73f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)