To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??∮?功ゅ??枇ァ????∮?功ゅ??枇ァ??B 0011111100111111100001111001001100111111100011001111011110000010111000110011111100111111100101001111100010000011010000000011111100111111001111110011111110000111100100110011111110001100111101111000001011100011001111110011111110010100111110001000001101000000001111110011111101000010 3f3f87933f8cf782e33f3f94f883403f3f3f3f87933f8cf782e33f3f94f883403f3f42
EUC-JP ????功ゅ??枇ァ??????功ゅ??枇ァ??B 001111110011111100111111001111111011100011111001101001001110010100111111001111111100100011111010101001011010000100111111001111110011111100111111001111110011111110111000111110011010010011100101001111110011111111001000111110101010010110100001001111110011111101000010 3f3f3f3fb8f9a4e53f3fc8faa5a13f3f3f3f3f3fb8f9a4e53f3fc8faa5a13f3f42
UTF-8 룴햶∮룴功ゅ룴점枇ァ룵햧룴햶∮룴功ゅ룴점枇ァ룵햧B 11101011101000111011010011101101100101101011011011100010100010001010111011101011101000111011010011100101100010101001111111100011100000101000010111101011101000111011010011101100101000001001000011100110100111101000011111100011100000101010000111101011101000111011010111101101100101101010011111101011101000111011010011101101100101101011011011100010100010001010111011101011101000111011010011100101100010101001111111100011100000101000010111101011101000111011010011101100101000001001000011100110100111101000011111100011100000101010000111101011101000111011010111101101100101101010011101000010 eba3b4ed96b6e288aeeba3b4e58a9fe38285eba3b4eca090e69e87e382a1eba3b5ed96a7eba3b4ed96b6e288aeeba3b4e58a9fe38285eba3b4eca090e69e87e382a1eba3b5ed96a742
UHC 룴햶∮룴功ゅ룴점枇ァ룵햧룴햶∮룴功ゅ룴점枇ァ룵햧B 10001111101010011100000110001111101000101011000110001111101010011100110111101101101010101110010110001111101010011100000110100001110111011110110110101011101000011000111110101010110000010111101010001111101010011100000110001111101000101011000110001111101010011100110111101101101010101110010110001111101010011100000110100001110111011110110110101011101000011000111110101010110000010111101001000010 8fa9c18fa2b18fa9cdedaae58fa9c1a1ddedaba18faac17a8fa9c18fa2b18fa9cdedaae58fa9c1a1ddedaba18faac17a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)