To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ?????ぜ???猥??議??柔ロ?沃??h 00111111001111110011111100111111001111111000001010111010001111110011111100111111111000001100111000111111001111111000101101100011001111110011111110001111010111111000001110001101001111111001011110000000001111110011111101101000 3f3f3f3f3f82ba3f3f3fe0ce3f3f8b633f3f8f5f838d3f97803f3f68
EUC-JP ???沅?ぜ???猥??議??柔ロ?沃??h 001111110011111100111111100011111100011011101001001111111010010010111100001111110011111100111111111000001101000000111111001111111011010111000100001111110011111110111101110000001010010111101101001111111100110111100000001111110011111101101000 3f3f3f8fc6e93fa4bc3f3f3fe0d03f3fb5c43f3fbdc0a5ed3fcde03f3f68
UTF-8 嶺뚮뿭沅좄ぜ流껋춷猥됰씮議롧춯柔ロ닑沃쇰뿿h 11101111101001101010101111101011100110101010111011101011101111111010110111100110101100101000010111101100101000101000010011100011100000011001110011101111101001111000101011101010101110111000101111101100101101101011011111100111100011001010010111101011100100001011000011101100100101001010111011101000101011011011000011101011101000011010011111101100101101101010111111100110100111111001010011100011100000111010110111101011100010111001000111100110101100101000001111101100100001111011000011101011101111111011111101101000 efa6abeb9aaeebbfade6b285eca284e3819cefa78aeabb8becb6b7e78ca5eb90b0ec94aee8adb0eba1a7ecb6afe69f94e383adeb8b91e6b283ec87b0ebbfbf68
UHC 嶺뚮뿭沅좄ぜ流껋춷猥됰씮議롧춯柔ロ닑沃쇰뿿h 11100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101010111111001000001111101100101011011001001111101000111001011000100111101011100111011011111111101100101000011000111011100111101011011000110011101010111101011010101111101101100010001001011011101000101010101011110011101011100101111011111101101000 e7ad8ceb97adeab6a0e8aabceafc83ecad93e8e589eb9dbfeca18ee7ad8ceaf5abed8896e8aabceb97bf68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)