To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應ソ 1110100010111110101100001110100010100010100101101110010110101101100110001110100010101010101100001110100011011111100111001110011010001101100010011110100010111110101100001110100010100010100101101110010110101101100110001110100010101010101100001110100011011111100111001110010010111111 e8beb0e8a296e5ad98e8aab0e8df9ce68d89e8beb0e8a296e5ad98e8aab0e8df9ce4bf
EUC-JP 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應ソ 1111000011000000100011101011000011110000101001001100110011100111100011101010110111010000111010101000111010101010100011101011000011110000111000011101100011101000101110011110100111110000110000001000111010110000111100001010010011001100111001111000111010101101110100001110101010001110101010101000111010110000111100001110000111011000111001101000111010111111 f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e8b9e9f0c08eb0f0a4cce78eadd0ea8eaa8eb0f0e1d8e68ebf
UTF-8 霎ー陲門ュ倩ェー鞜懈拷霎ー陲門ュ倩ェー鞜應ソ 111010011001110010001110111011111011110110110000111010011001100110110010111010011001011010000000111011111011110110101101111001011000000010101001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001000111001101000101110110111111010011001110010001110111011111011110110110000111010011001100110110010111010011001011010000000111011111011110110101101111001011000000010101001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001001111011111011110110111111 e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68788e68bb7e99c8eefbdb0e999b2e99680efbdade580a9efbdaaefbdb0e99e9ce68789efbdbf
UHC ???門?????懈拷???門?????應? 001111110011111100111111110110101010011000111111001111110011111100111111001111111111101010101011110011011011100000111111001111110011111111011010101001100011111100111111001111110011111100111111111010111110101100111111 3f3f3fdaa63f3f3f3f3ffaabcdb83f3f3fdaa63f3f3f3f3febeb3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)