To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鞨∬ェケ蹊ウ驕丈驍∬ェケ迸ウ鞜丞直^ 11101000111000001000000111101000101010101011100111100110111111001011001111101001100000011000111111100100111100001011110011101001100000101000000111101000101010101011100111100111100111101011001111101000110111111000111111100101100100101011110001011110 e8e081e8aab9e6fcb3e9818fe4f0bce98281e8aab9e79eb3e8df8fe592bc5e
EUC-JP 鞨∬ェケ蹊ウ驕丈?驍∬ェケ迸ウ鞜丞直^ 111100001110001010100010111010101000111010101010100011101011100111101100111111101000111010110011111100011110000110111110111001100011111111110001111000101010001011101010100011101010101010001110101110011110110111111110100011101011001111110000111000011011111011100111110001001011111001011110 f0e2a2ea8eaa8eb9ecfe8eb3f1e1bee63ff1e2a2ea8eaa8eb9edfe8eb3f0e1bee7c4be5e
UTF-8 鞨∬ェケ蹊ウ驕丈驍∬ェケ迸ウ鞜丞直^ 11101001100111101010100011100010100010001010110011101111101111011010101011101111101111011011100111101000101110011000101011101111101111011011001111101001101010011001010111100100101110001000100011101110100000011011101111101001101010011000110111100010100010001010110011101111101111011010101011101111101111011011100111101000101111111011100011101111101111011011001111101001100111101001110011100100101110001001111011100111100110111011010001011110 e99ea8e288acefbdaaefbdb9e8b98aefbdb3e9a995e4b888ee81bbe9a98de288acefbdaaefbdb9e8bfb8efbdb3e99e9ce4b89ee79bb45e
UHC 鞨∬??蹊?驕丈?驍∬?????丞直^ 11001010111010101010000111110011001111110011111111111011101101110011111111001110111101101110110111011011001111111111110110100100101000011111001100111111001111110011111100111111001111111110001110101010111100101100000101011110 caeaa1f33f3ffbb73fcef6eddb3ffda4a1f33f3f3f3f3fe3aaf2c15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)