To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???宜?ぜ松??域??誼??巡??沃??B 0011111100111111001111111000101101011000001111111000001010111010100011111011110000111111001111111000100011100110001111110011111110001011011000100011111100111111100011111000010000111111001111111001011110000000001111110011111101000010 3f3f3f8b583f82ba8fbc3f3f88e63f3f8b623f3f8f843f3f97803f3f42
EUC-JP ???宜?ぜ松??域??誼??巡??沃??B 0011111100111111001111111011010110111001001111111010010010111100101111101011111000111111001111111011000011101000001111110011111110110101110000110011111100111111101111011110010000111111001111111100110111100000001111110011111101000010 3f3f3fb5b93fa4bcbebe3f3fb0e83f3fb5c33f3fbde43f3fcde03f3f42
UTF-8 嶺뚮뿫宜배ぜ松쎌춷域㏓벡誼뗥슖巡볥룂沃쇳룞B 11101111101001101010101111101011100110101010111011101011101111111010101111100101101011101001110011101011101100001011000011100011100000011001110011100110100111011011111011101100100011101000110011101100101101101011011111100101100111111001111111100011100011111001001111101011101100101010000111101000101010101011110011101011100101111010010111101100100010101001011011100101101101111010000111101011101100111010010111101011101000111000001011100110101100101000001111101100100001111011001111101011101000111001111001000010 efa6abeb9aaeebbfabe5ae9cebb0b0e3819ce69dbeec8e8cecb6b7e59f9fe38f93ebb2a1e8aabceb97a5ec8a96e5b7a1ebb3a5eba382e6b283ec87b3eba39e42
UHC 嶺뚮뿫宜배ぜ松쎌춷域㏓벡誼뗥슖巡볥룂沃쇳룞B 11100111101011011000110011101011100101111010101111101011111100011011100111101000101010101011110011100001111001101011110111101100101011011001001111100110101101001010011111101011101110101010010011101011111111101000101111100101100110101010010111100010110111101001001111101011100011111000001111101000101010101011110011101101100011111001100101000010 e7ad8ceb97abebf1b9e8aabce1e6bdecad93e6b4a7ebbaa4ebfe8be59aa5e2de93eb8f83e8aabced8f9942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)