To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??惟??亦?????癒??嚴ъ?二?? 11101001111100010011111100111111100010001101001000111111001111111001011010010010001111110011111100111111001111110011111110010110111111000011111100111111100110101000111010000100100011000011111110010011111100010011111100111111 e9f13f3f88d23f3f96923f3f3f3f3f96fc3f3f9a8e848c3f93f13f3f
EUC-JP 鴦??惟??亦??庾??癒??嚴ъ?二?? 111100101111001100111111001111111011000011010100001111110011111111001011111100100011111100111111100011111011110011001110001111110011111111001100111111100011111100111111110100111110111010100111111011000011111111000110111100110011111100111111 f2f33f3fb0d43f3fcbf23f3f8fbcce3f3fccfe3f3fd3eea7ec3fc6f33f3f
UTF-8 鴦꾨땶惟깊깵亦껋눖庾얏쨫癒⑸눤嚴ъ쥙二뜹럡 1110100110110100101001101110101010111110101010001110101110010101101101101110011010000011100111111110101010111001100010101110101010111001101101011110010010111010101001101110101010111011100010111110101110001000100101101110010110111010101111101110110010010110100011111110110010101000101010111110011110011001100100101110001010010001101110001110101110001000101001001110010110011010101101001101000110001010111011001010010110011001111001001011101010001100111010111001110010111001111010111001111110100001 e9b4a6eabea8eb95b6e6839feab98aeab9b5e4baa6eabb8beb8896e5babeec968feca8abe79992e291b8eb88a4e59ab4d18aeca599e4ba8ceb9cb9eb9fa1
UHC 鴦꾨땶惟깊깵亦껋눖庾얏쨫癒⑸눤嚴ъ쥙二뜹럡 111001001110110010000100111010111000101110001100111010101110111010110001111011011000001110100011111001101011001010000011111011001000011110110000111010101110110010111110111001101010010010000101111010111010100010101001111010111000011110111011111001011111000110101100111011001010001010001110111011001010001110110110111001011000111010000100 e4ec84eb8b8ceaeeb1ed83a3e6b283ec87b0eaecbee6a485eba8a9eb87bbe5f1aceca28eeca3b6e58e84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)