To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訒「閾ェ訒夲スク﨡会スス鮖ソ蠇帷朕訒、髮ォ 111110111010001110100010111010001000011110101010111110111010001110011010111011111011110110111000111110111010000010001001111011111011110110111101111010011011100110111111111110111010000110011011111001111001001010111101111110111010001110100100111010011001101110101011 fba3a2e887aafba39aefbdb8fba089efbdbde9b9bffba19be792bdfba3a4e99bab
EUC-JP 訒「閾ェ訒夲スク?会スス鮖ソ?帷朕訒、髮ォ 10001111110111011100100010001110101000101110111111100111100011101010101010001111110111011100100011010100111100011000111010111101100011101011100000111111101100101111000110001110101111011000111010111101111100101011101110001110101111110011111111010110111010011100010010111111100011111101110111001000100011101010010011110001111110111000111010101011 8fddc88ea2efe78eaa8fddc8d4f18ebd8eb83fb2f18ebd8ebdf2bb8ebf3fd6e9c4bf8fddc88ea4f1fb8eab
UTF-8 訒「閾ェ訒夲スク﨡会スス鮖ソ蠇帷朕訒、髮ォ 111010001010100010010010111011111011110110100010111010011001011010111110111011111011110110101010111010001010100010010010111001011010010010110010111011111011110110111101111011111011110110111000111011111010100010100001111001001011110010011010111011111011110110111101111011111011110110111101111010011010111010010110111011111011110110111111111010001010000010000111111001011011100010110111111001101001110010010101111010001010100010010010111011111011110110100100111010011010101110101110111011111011110110101011 e8a892efbda2e996beefbdaae8a892e5a4b2efbdbdefbdb8efa8a1e4bc9aefbdbdefbdbde9ae96efbdbfe8a087e5b8b7e69c95e8a892efbda4e9abaeefbdab
UHC ????????????????朕??髮? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111110010111110010011111100111111110110111010010100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3ff2f93f3fdba53f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)