To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????伎???猿???????ぜ?? 001111110011111100111111001111110011111100111111100010101110101000111111001111110011111110001001100011100011111100111111001111110011111100111111001111110011111110000010101110100011111100111111 3f3f3f3f3f3f8aea3f3f3f898e3f3f3f3f3f3f3f82ba3f3f
EUC-JP ???靷??伎彛??猿?????沅?ぜ庾? 0011111100111111001111111000111111100111101111010011111100111111101101001110110010001111101111001111101000111111001111111011000111101110001111110011111100111111001111110011111110001111110001101110100100111111101001001011110010001111101111001100111000111111 3f3f3f8fe7bd3f3fb4ec8fbcfa3f3fb1ee3f3f3f3f3f8fc6e93fa4bc8fbcce3f
UTF-8 嶺뚮뿫靷뗥첎伎彛볣뇡猿됯컮嶺뚮뿭沅좄ぜ庾쥲 111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111010111001011110100101111011001011001010001110111001001011110010001110111001011011110110011011111010111011001110100011111010111000011110100001111001111000110010111111111010111001000010101111111011001011101110101110111011111010011010101011111010111001101010101110111010111011111110101101111001101011001010000101111011001010001010000100111000111000000110011100111001011011101010111110111011001010010110110010 efa6abeb9aaeebbfabe99db7eb97a5ecb28ee4bc8ee5bd9bebb3a3eb87a1e78cbfeb90afecbbaeefa6abeb9aaeebbfade6b285eca284e3819ce5babeeca5b2
UHC 嶺뚮뿫靷뗥첎伎彛볣뇡猿됯컮嶺뚮뿭沅좄ぜ庾쥲 111001111010110110001100111010111001011110101011111011001110011010001011111001011010101010011011110100001110101111101100101011011001001111101001100001111000100111101010101110111000100111101010101100001001010011100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101010111011001010001101000010 e7ad8ceb97abece68be5aa9bd0ebecad93e98789eabb89eab094e7ad8ceb97adeab6a0e8aabceaeca342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)