To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應 11101000101111101011000011101000101000101001011011100101101011011001100011101000101010101011000011101000110111111001110011100110100011011000100111101000101111101011000011101000101000101001011011100101101011011001100011101000101010101011000011101000110111111001110011100100 e8beb0e8a296e5ad98e8aab0e8df9ce68d89e8beb0e8a296e5ad98e8aab0e8df9ce4
EUC-JP 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應 111100001100000010001110101100001111000010100100110011001110011110001110101011011101000011101010100011101010101010001110101100001111000011100001110110001110100010111001111010011111000011000000100011101011000011110000101001001100110011100111100011101010110111010000111010101000111010101010100011101011000011110000111000011101100011100110 f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e8b9e9f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e6
UTF-8 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應 111010011001110010001110111011111011110110110000111010011001100110110010111010011001011010000000111011111011110110101101111001011000000010101001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001000111001101000101110110111111010011001110010001110111011111011110110110000111010011001100110110010111010011001011010000000111011111011110110101101111001011000000010101001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001001 e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68788e68bb7e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68789
UHC ???門?????懈拷???門?????應 0011111100111111001111111101101010100110001111110011111100111111001111110011111111111010101010111100110110111000001111110011111100111111110110101010011000111111001111110011111100111111001111111110101111101011 3f3f3fdaa63f3f3f3f3ffaabcdb83f3f3fdaa63f3f3f3f3febeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)