To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??惟??耶??踰??韋??沃????┐ 11101001111100010011111100111111100010001101001000111111001111111001011011101011001111110011111111100110111110100011111100111111111010001110100000111111001111111001011110000000001111110011111100111111001111111000010010100010 e9f13f3f88d23f3f96eb3f3fe6fa3f3fe8e83f3f97803f3f3f3f84a2
EUC-JP 鴦??惟??耶??踰??韋??沃????┐ 11110010111100110011111100111111101100001101010000111111001111111100110011101101001111110011111111101100111111000011111100111111111100001110101000111111001111111100110111100000001111110011111100111111001111111010100010100100 f2f33f3fb0d43f3fcced3f3fecfc3f3ff0ea3f3fcde03f3f3f3fa8a4
UTF-8 鴦꾨땶惟깅룴耶쇨쑬踰⑼쭕韋삳샷沃섅굥藺잞┐ 111010011011010010100110111010101011111010101000111010111001010110110110111001101000001110011111111010101011100110000101111010111010001110110100111010001000000010110110111011001000011110101000111011001001000110101100111010001011100010110000111000101001000110111100111011001010110110010101111010011001111110001011111011001000001010110011111011001000001110110111111001101011001010000011111011001000010010000101111010101011010110100101111011111010011110110000111011001001111010011110111000101001010010010000 e9b4a6eabea8eb95b6e6839feab985eba3b4e880b6ec87a8ec91ace8b8b0e291bcecad95e99f8bec82b3ec83b7e6b283ec8485eab5a5efa7b0ec9e9ee29490
UHC 鴦꾨땶惟깅룴耶쇨쑬踰⑼쭕韋삳샷沃섅굥藺잞┐ 111001001110110010000100111010111000101110001100111010101110111010110001111010111000111110101001111001011010110110111100111010101011111010101000111010111011001010101001111011111010011110001101111010101101111110111011111010111011110010100110111010001010101010011000111000111000001010001011111011001110000110011111111011111010011010100100 e4ec84eb8b8ceaeeb1eb8fa9e5adbceabea8ebb2a9efa78deadfbbebbca6e8aa98e3828bece19fefa6a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)