To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??油?㎝鎖??艶j?淫??誘④?? 10011110111101000011111100111111100101101111101100111111100001110111000010001101101111010011111100111111100010011001000010000010100010100011111110001000111110100011111100111111100101110101010110000111010000110011111100111111 9ef43f3f96fb3f87708dbd3f3f8990828a3f88fa3f3f975587433f3f
EUC-JP 橈??油??鎖??艶j?淫??誘??? 1101110011110110001111110011111111001100111111010011111100111111101110101011111100111111001111111011000111110000101000111110101000111111101100001111110000111111001111111100110110110110001111110011111100111111 dcf63f3fccfd3f3fbabf3f3fb1f0a3ea3fb0fc3f3fcdb63f3f3f
UTF-8 橈볥굝油꾬㎝鎖듬쨨艶j쑨淫앯춯誘④뻗歷 111001101010100110001000111010111011001110100101111010101011010110011101111001101011001010111001111010101011111010101100111000111000111010011101111010011000111010010110111010111001001110101100111011001010100010101000111010001000100110110110111011111011110110001010111011001001000110101000111001101011011110101011111011001001010110101111111011001011011010101111111010001010101010011000111000101001000110100011111010111011101110010111111011111010011010001100 e6a988ebb3a5eab59de6b2b9eabeace38e9de98e96eb93aceca8a8e889b6efbd8aec91a8e6b7abec95afecb6afe8aa98e291a3ebbb97efa68c
UHC 橈볥굝油꾬㎝鎖듬쨨艶j쑨淫앯춯誘④뻗歷 1110100011111010100100111110101110000010100001011110101011111010100001001110111110100111101011111110000111110000101101011110101110100100100000111110011011111101101000111110101010111110101001111110101111100010100111011110011110101101100011001110101110101111101010001110101010111011101110001110011010111000 e8fa93eb8285eafa84efa7afe1f0b5eba483e6fda3eabea7ebe29de7ad8cebafa8eabbb8e6b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)