To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??儒??語⑥?異??純???ο?悠 11100001100111110011111100111111100101110101000100111111001111111000111011110010001111110011111110001100111010101000011101000101001111111000100011011001001111110011111110001111100000110011111100111111001111111000001111001101001111111001011101001001 e19f3f3f97513f3f8ef23f3f8cea87453f88d93f3f8f833f3f3f83cd3f9749
EUC-JP 癲??猷??儒??語??異??純???ο?悠 111000101010000100111111001111111100110110110010001111110011111110111100111101000011111100111111101110001110110000111111001111111011000011011011001111110011111110111101111000110011111100111111001111111010011011001111001111111100110110101010 e2a13f3fcdb23f3fbcf43f3fb8ec3f3fb0db3f3fbde33f3f3fa6cf3fcdaa
UTF-8 癲숆낄猷쀦껸儒룸눤語⑥궡異멨윜純볦벞若ο퐟悠 1110011110011001101100101110110010001000100001101110101110000010100001001110011110001100101101111110110010000000101001101110101010111011101110001110010110000100100100101110101110100011101110001110101110001000101001001110100010101010100111101110001010010001101001011110101010110110101000011110011110010101101100001110101110101001101010001110110010011100100111001110011110110100100101001110101110110011101001101110101110110010100111101110111110100101101101001100111010111111111011011001000010011111111001101000001010100000 e799b2ec8886eb8284e78cb7ec80a6eabbb8e58492eba3b8eb88a4e8aa9ee291a5eab6a1e795b0eba9a8ec9c9ce7b494ebb3a6ebb29eefa5b4cebfed909fe682a0
UHC 癲숆낄猷쀦껸儒룸눤語⑥궡異멨윜純볦벞若ο퐟悠 1110111110100110100110011110101010110011101001011110101110100011100101111110011010110010101110011110101011100011101101111110101110000111101110111110010111011110101010001110110010000010101101001110110010110110101110001110010110011111100111111110001011101101100100111110110010010011101110011110010110101110101001011110111110111101100010001110101011101101 efa699eab3a5eba397e6b2b9eae3b7eb87bbe5dea8ec82b4ecb6b8e59f9fe2ed93ec93b9e5aea5efbd88eaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)