To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 汁?刀梓??蒸???汁?刀梓??蒸???^ 1000111101100000001111111001001110000001100010001011001000111111001111111000111111110110001111110011111100111111100011110110000000111111100100111000000110001000101100100011111100111111100011111111011000111111001111110011111101011110 8f603f938188b23f3f8ff63f3f3f8f603f938188b23f3f8ff63f3f3f5e
EUC-JP 汁?刀梓??蒸???汁?刀梓??蒸???^ 1011110111000001001111111100010111100001101100001011010000111111001111111011111011111000001111110011111100111111101111011100000100111111110001011110000110110000101101000011111100111111101111101111100000111111001111110011111101011110 bdc13fc5e1b0b43f3fbef83f3f3fbdc13fc5e1b0b43f3fbef83f3f3f5e
UTF-8 汁흙刀梓띕웃蒸얗렒돔汁흙刀梓띕웃蒸얗렒돔^ 11100110101100011000000111101101100111011001100111100101100010001000000011100110101000101001001111101011100111011001010111101100100110111000001111101000100100101011100011101100100101101001011111101011101000001001001011101011100011111001010011100110101100011000000111101101100111011001100111100101100010001000000011100110101000101001001111101011100111011001010111101100100110111000001111101000100100101011100011101100100101101001011111101011101000001001001011101011100011111001010001011110 e6b181ed9d99e58880e6a293eb9d95ec9b83e892b8ec9697eba092eb8f94e6b181ed9d99e58880e6a293eb9d95ec9b83e892b8ec9697eba092eb8f945e
UHC 汁흙刀梓띕웃蒸얗렒돔汁흙刀梓띕웃蒸얗렒돔^ 1111000111110000110010001110101111010011111011111110111010101001101101101110101110111111111101001111000111111010101111101110100110001110101001111011010110111100111100011111000011001000111010111101001111101111111011101010100110110110111010111011111111110100111100011111101010111110111010011000111010100111101101011011110001011110 f1f0c8ebd3efeea9b6ebbff4f1fabee98ea7b5bcf1f0c8ebd3efeea9b6ebbff4f1fabee98ea7b5bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)