To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霓??蹂?霓??蹂?B 111010001011110100111111001111111110011011111000001111111110100010111101001111110011111111100110111110000011111101000010 e8bd3f3fe6f83fe8bd3f3fe6f83f42
EUC-JP 霓??蹂?霓??蹂?B 111100001011111100111111001111111110110011111010001111111111000010111111001111110011111111101100111110100011111101000010 f0bf3f3fecfa3ff0bf3f3fecfa3f42
UTF-8 霓ㅻ뀘蹂븎霓ㅻ뀘蹂븎B 11101001100111001001001111100011100001011011101111101011100000001001100011101000101110011000001011101011101110001000111011101001100111001001001111100011100001011011101111101011100000001001100011101000101110011000001011101011101110001000111001000010 e99c93e385bbeb8098e8b982ebb88ee99c93e385bbeb8098e8b982ebb88e42
UHC 霓ㅻ뀘蹂븎霓ㅻ뀘蹂븎B 111001111110011110100100111010111000010110010001111010111011001110010101011101101110011111100111101001001110101110000101100100011110101110110011100101010111011001000010 e7e7a4eb8591ebb39576e7e7a4eb8591ebb3957642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)