To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 秧??豫??涯??[秧??豫??涯??[^ 111000100101111000111111001111111001100010101100001111110011111110001010010101010011111100111111010110111110001001011110001111110011111110011000101011000011111100111111100010100101010100111111001111110101101101011110 e25e3f3f98ac3f3f8a553f3f5be25e3f3f98ac3f3f8a553f3f5b5e
EUC-JP 秧??豫??涯??[秧??豫??涯??[^ 111000111011111100111111001111111101000010101110001111110011111110110011101101100011111100111111010110111110001110111111001111110011111111010000101011100011111100111111101100111011011000111111001111110101101101011110 e3bf3f3fd0ae3f3fb3b63f3f5be3bf3f3fd0ae3f3fb3b63f3f5b5e
UTF-8 秧ⓨ븵豫뷴뜥涯욆끆[秧ⓨ븵豫뷴뜥涯욆끆[^ 111001111010011110100111111000101001001110101000111010111011100010110101111010001011000110101011111010111011011110110100111010111001110010100101111001101011011010101111111011001001101010000110111010111000000110000110010110111110011110100111101001111110001010010011101010001110101110111000101101011110100010110001101010111110101110110111101101001110101110011100101001011110011010110110101011111110110010011010100001101110101110000001100001100101101101011110 e7a7a7e293a8ebb8b5e8b1abebb7b4eb9ca5e6b6afec9a86eb81865be7a7a7e293a8ebb8b5e8b1abebb7b4eb9ca5e6b6afec9a86eb81865b5e
UHC 秧ⓨ븵豫뷴뜥涯욆끆[秧ⓨ븵豫뷴뜥涯욆끆[^ 111001001110101110101000111001011001010110011110111001111110001110111010111001011000110110101000111001001111001110011110111010001000010110111010010110111110010011101011101010001110010110010101100111101110011111100011101110101110010110001101101010001110010011110011100111101110100010000101101110100101101101011110 e4eba8e5959ee7e3bae58da8e4f39ee885ba5be4eba8e5959ee7e3bae58da8e4f39ee885ba5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)