To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??溢▽?音??碍??依??循??筌 100101001010100000111111001111111000100011101100100000011010010000111111100010011011100100111111001111111000101001010110001111110011111110001000110010110011111100111111100011110111101000111111001111111110001010100011 94a83f3f88ec81a43f89b93f3f8a563f3f88cb3f3f8f7a3f3fe2a3
EUC-JP 畑??溢▽?音??碍?Ŋ依??循??筌 1100100010101010001111110011111110110000111011101010001010100110001111111011001010111011001111110011111110110011101101110011111110001111101010011010101110110000110011010011111100111111101111011101101100111111001111111110010010100101 c8aa3f3fb0eea2a63fb2bb3f3fb3b73f8fa9abb0cd3f3fbddb3f3fe4a5
UTF-8 畑듬뿰溢▽풚音뚰뭽碍⑸Ŋ依뷴춢循뗫쐞筌 1110011110010101100100011110101110010011101011001110101110111111101100001110011010111010101000101110001010010110101111011110110110010010100110101110100110011111101100111110101110011010101100001110101110101101101111011110011110100010100011011110001010010001101110001100010110001010111001001011111010011101111010111011011110110100111011001011011010100010111001011011111010101010111010111001011110101011111011001001000010011110111001111010110110001100 e79591eb93acebbfb0e6baa2e296bded929ae99fb3eb9ab0ebadbde7a28de291b8c58ae4be9debb7b4ecb6a2e5beaaeb97abec909ee7ad8c
UHC 畑듬뿰溢▽풚音뚰뭽碍⑸Ŋ依뷴춢循뗫쐞筌 1110111110100101101101011110101110010111101100001110110011101110101000011110010010111110100111011110101111100101100011001110110110010010100011001110010011110100101010011110101110101000101011111110101111101110101110101110010110101101100000111110001011100000100010111110101110011100100001001110111110100111 efa5b5eb97b0eceea1e4be9debe58ced928ce4f4a9eba8afebeebae5ad83e2e08beb9c84efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)