To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??椅ф?以?$源??倭??悠??怨?? 111000011001111100111111001111111000100011010110100001001000011000111111100010001100100000111111100000011001000010001100101110010011111100111111100110000110000000111111001111111001011101001001001111110011111110001001100001010011111100111111 e19f3f3f88d684863f88c83f81908cb93f3f98603f3f97493f3f89853f3f
EUC-JP 癲??椅ф?以?$源??倭??悠??怨?? 111000101010000100111111001111111011000011011000101001111110011000111111101100001100101000111111101000011111000010111000101110110011111100111111110011111100000100111111001111111100110110101010001111110011111110110001111001010011111100111111 e2a13f3fb0d8a7e63fb0ca3fa1f0b8bb3f3fcfc13f3fcdaa3f3fb1e53f3f
UTF-8 癲ㅻ슡椅ф뤃以잌$源놁쪣倭몃쓣悠낂춳怨몃뼹 1110011110011001101100101110001110000101101110111110110010001010101000011110011010100100100001011101000110000100111010111010010010000011111001001011101110100101111011001001111010001100111011111011110010000100111001101011101010010000111010111000011010000001111011001010101010100011111001011000000010101101111010111010101010000011111011001001001110100011111001101000001010100000111010111000001010000010111011001011011010110011111001101000000010101000111010111010101010000011111010111011110010111001 e799b2e385bbec8aa1e6a485d184eba483e4bba5ec9e8cefbc84e6ba90eb8681ecaaa3e580adebaa83ec93a3e682a0eb8282ecb6b3e680a8ebaa83ebbcb9
UHC 癲ㅻ슡椅ф뤃以잌$源놁쪣倭몃쓣悠낂춳怨몃뼹 111011111010011010100100111010111001101010101101111010111111010110101100111001101000111110110100111011001010010010011111111001011010001110100100111010101011100110000110111011001010010110011100111010001101111010111000111010111001110110000100111010101110110110000101111010011010110110001111111010101011001110111000111010111001011010111100 efa6a4eb9aadebf5ace68fb4eca49fe5a3a4eab986eca59ce8deb8eb9d84eaed85e9ad8feab3b8eb96bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)