To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???巡??銀??碍??悠?┏遺????~ 00111111001111110011111110001111100001000011111100111111100010111110001000111111001111111000101001010110001111110011111110010111010010010011111110000100101011001000100011100010001111110011111100111111001111111000000101100000 3f3f3f8f843f3f8be23f3f8a563f3f97493f84ac88e23f3f3f3f8160
EUC-JP ???巡??銀??碍??悠?┏遺????〜 00111111001111110011111110111101111001000011111100111111101101101110010000111111001111111011001110110111001111110011111111001101101010100011111110101000101011101011000011100100001111110011111100111111001111111010000111000001 3f3f3fbde43f3fb6e43f3fb3b73f3fcdaa3fa8aeb0e43f3f3f3fa1c1
UTF-8 樂낅뜄巡곭뙴銀㏓븶碍⑸쵐悠득┏遺얜굜連얠~ 111011111010011010111111111010111000001010000101111010111001110010000100111001011011011110100001111010101011001110101101111010111001100110110100111010011000101010000000111000111000111110010011111010111011100010110110111001111010001010001101111000101001000110111000111011001011010110010000111001101000001010100000111010111001001110011101111000101001010010001111111010011000000110111010111011001001011010011100111010101011010110011100111011111010011010011010111011001001011010100000111011111011110110011110 efa6bfeb8285eb9c84e5b7a1eab3adeb99b4e98a80e38f93ebb8b6e7a28de291b8ecb590e682a0eb939de2948fe981baec969ceab59cefa69aec96a0efbd9e
UHC 樂낅뜄巡곭뙴銀㏓븶碍⑸쵐悠득┏遺얜굜連얠~ 111010001111100110000101111010111000110110001000111000101101111010000001111001111000110010110111111010111101111010100111111010111001010110011111111001001111010010101001111010111010110010010010111010101110110110110101111001101010011010101110111010111011011010111110111010111000001010000100111001101110011010111110111011001010001010100110 e8f985eb8d88e2de81e78cb7ebdea7eb959fe4f4a9ebac92eaedb5e6a6aeebb6beeb8284e6e6beeca2a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)