To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????Uh???????????U 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010101101000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f55683f3f3f3f3f3f3f3f3f3f3f55
SJIS-WIN 霎ー陲門ュ倩ェー鞜懈純Uh霎ー陲門ュ倩ェー鞜懈純U 111010001011111010110000111010001010001010010110111001011010110110011000111010001010101010110000111010001101111110011100111001101000111110000011010101010110100011101000101111101011000011101000101000101001011011100101101011011001100011101000101010101011000011101000110111111001110011100110100011111000001101010101 e8beb0e8a296e5ad98e8aab0e8df9ce68f835568e8beb0e8a296e5ad98e8aab0e8df9ce68f8355
EUC-JP 霎ー陲門ュ倩ェー鞜懈純Uh霎ー陲門ュ倩ェー鞜懈純U 1111000011000000100011101011000011110000101001001100110011100111100011101010110111010000111010101000111010101010100011101011000011110000111000011101100011101000101111011110001101010101011010001111000011000000100011101011000011110000101001001100110011100111100011101010110111010000111010101000111010101010100011101011000011110000111000011101100011101000101111011110001101010101 f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e8bde35568f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e8bde355
UTF-8 霎ー陲門ュ倩ェー鞜懈純Uh霎ー陲門ュ倩ェー鞜懈純U 111010011001110010001110111011111011110110110000111010011001100110110010111010011001011010000000111011111011110110101101111001011000000010101001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001000111001111011010010010100010101010110100011101001100111001000111011101111101111011011000011101001100110011011001011101001100101101000000011101111101111011010110111100101100000001010100111101111101111011010101011101111101111011011000011101001100111101001110011100110100001111000100011100111101101001001010001010101 e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68788e7b4945568e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68788e7b49455
UHC ???門?????懈純Uh???門?????懈純U 00111111001111110011111111011010101001100011111100111111001111110011111100111111111110101010101111100010111011010101010101101000001111110011111100111111110110101010011000111111001111110011111100111111001111111111101010101011111000101110110101010101 3f3f3fdaa63f3f3f3f3ffaabe2ed55683f3f3fdaa63f3f3f3f3ffaabe2ed55

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)