To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??維??猿????????????c?誼 111000101010001100111111001111111000100011011011001111110011111110001001100011100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000001010000011001111111000101101100010 e2a33f3f88db3f3f898e3f3f3f3f3f3f3f3f3f3f3f3f82833f8b62
EUC-JP 筌??維??猿??倻????????渶c?誼 11100100101001010011111100111111101100001101110100111111001111111011000111101110001111110011111110001111101100011111011000111111001111110011111100111111001111110011111100111111001111111000111111000111111011011010001111100011001111111011010111000011 e4a53f3fb0dd3f3fb1ee3f3f8fb1f63f3f3f3f3f3f3f3f8fc7eda3e33fb5c3
UTF-8 筌뚭여維뽩쫿猿뗫툡倻뽮퍗留볩㎘類앸샹渶c렖誼 111001111010110110001100111010111001101010101101111011001001011110101100111001111011011010101101111010111011110110101001111011001010101110111111111001111000110010111111111010111001011110101011111011011000100010100001111001011000000010111011111010111011110110101110111011011000110110010111111011111010011110001101111010111011001110101001111000111000111010011000111011111010011110010000111011001001010110111000111011001000001110111001111001101011100010110110111011111011110110000011111010111010000010010110111010001010101010111100 e7ad8ceb9aadec97ace7b6adebbda9ecabbfe78cbfeb97abed88a1e580bbebbdaeed8d97efa78debb3a9e38e98efa790ec95b8ec83b9e6b8b6efbd83eba096e8aabc
UHC 筌뚭여維뽩쫿猿뗫툡倻뽮퍗留볩㎘類앸샹渶c렖誼 1110111110100111100011001110101010111111101010011110101110101011100101101110010110100110100101101110101010111011100010111110101110111000100110001110010110100110100101101110101010111011100011101110101110100111100100111110111110100111101001011110101110111010100111011110101110111100101001111110011110110111101000111110001110001110101010111110101111111110 efa78ceabfa9ebab96e5a696eabb8bebb898e5a696eabb8eeba793efa7a5ebba9debbca7e7b7a3e38eabebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)