To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??游①????勇??音?ぃ鷹??狎 111000001011111000111111001111111001111111100000100001110100000000111111001111110011111100111111100101110100010100111111001111111000100110111001001111111000001010100001100100011110100100111111001111111110000010111110 e0be3f3f9fe087403f3f3f3f97453f3f89b93f82a191e93f3fe0be
EUC-JP 狎??游??洧??勇??音?ぃ鷹??狎 11100000110000000011111100111111110111101110001000111111001111111000111111000111101101000011111100111111110011011010011000111111001111111011001010111011001111111010010010100011110000101110101100111111001111111110000011000000 e0c03f3fdee23f3f8fc7b43f3fcda63f3fb2bb3fa4a3c2eb3f3fe0c0
UTF-8 狎녿씒游①뛾洧곗뎾勇싰만音귝ぃ鷹숈돱狎 111001111000101110001110111010111000010110111111111011001001010010010010111001101011100010111000111000101001000110100000111010111001101110111110111001101011010010100111111010101011001110010111111010111000111010111110111001011000101110000111111011001000101110110000111010111010011110001100111010011001111110110011111010101011011110011101111000111000000110000011111010011011011110111001111011001000100010001000111010111000111110110001111001111000101110001110 e78b8eeb85bfec9492e6b8b8e291a0eb9bbee6b4a7eab397eb8ebee58b87ec8bb0eba78ce99fb3eab79de38183e9b7b9ec8888eb8fb1e78b8e
UHC 狎녿씒游①뛾洧곗뎾勇싰만音귝ぃ鷹숈돱狎 1110010011100100100001101110101110011101101010001110101011111101101010001110011110001101100001001110101011111011101100001110110010001001100100011110100110111000100110101110101010111000101110001110101111100101100000101110011010101010101000111110101111101101100110011110110010001001101101001110010011100100 e4e486eb9da8eafda8e78d84eafbb0ec8991e9b89aeab8b8ebe582e6aaa3ebed99ec89b4e4e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)