To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 臾?ぜ維?Н維?しi臾?ぜ維?Н維?しiB 111001000110101100111111100000101011101010001000110110110011111110000100010011101000100011011011001111111000001010110101011010011110010001101011001111111000001010111010100010001101101100111111100001000100111010001000110110110011111110000010101101010110100101000010 e46b3f82ba88db3f844e88db3f82b569e46b3f82ba88db3f844e88db3f82b56942
EUC-JP 臾?ぜ維?Н維?しi臾?ぜ維?Н維?しiB 111001111100110000111111101001001011110010110000110111010011111110100111101011111011000011011101001111111010010010110111011010011110011111001100001111111010010010111100101100001101110100111111101001111010111110110000110111010011111110100100101101110110100101000010 e7cc3fa4bcb0dd3fa7afb0dd3fa4b769e7cc3fa4bcb0dd3fa7afb0dd3fa4b76942
UTF-8 臾노ぜ維믩Н維뚮しi臾노ぜ維믩Н維뚮しiB 11101000100001111011111011101011100001011011100011100011100000011001110011100111101101101010110111101011101011111010100111010000100111011110011110110110101011011110101110011010101011101110001110000001100101110110100111101000100001111011111011101011100001011011100011100011100000011001110011100111101101101010110111101011101011111010100111010000100111011110011110110110101011011110101110011010101011101110001110000001100101110110100101000010 e887beeb85b8e3819ce7b6adebafa9d09de7b6adeb9aaee3819769e887beeb85b8e3819ce7b6adebafa9d09de7b6adeb9aaee381976942
UHC 臾노ぜ維믩Н維뚮しi臾노ぜ維믩Н維뚮しiB 111010111010110010110011111010111010101010111100111010111010101110010010111010111010110010101111111010111010101110001100111010111010101010110111011010011110101110101100101100111110101110101010101111001110101110101011100100101110101110101100101011111110101110101011100011001110101110101010101101110110100101000010 ebacb3ebaabcebab92ebacafebab8cebaab769ebacb3ebaabcebab92ebacafebab8cebaab76942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)