To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 猥???畑??意?}猥???畑??意?{^ 111000001100111000111111001111110011111110010100101010000011111100111111100010001101001100111111011111011110000011001110001111110011111100111111100101001010100000111111001111111000100011010011001111110111101101011110 e0ce3f3f3f94a83f3f88d33f7de0ce3f3f3f94a83f3f88d33f7b5e
EUC-JP 猥???畑??意?}猥???畑??意?{^ 111000001101000000111111001111110011111111001000101010100011111100111111101100001101010100111111011111011110000011010000001111110011111100111111110010001010101000111111001111111011000011010101001111110111101101011110 e0d03f3f3fc8aa3f3fb0d53f7de0d03f3f3fc8aa3f3fb0d53f7b5e
UTF-8 猥롋꾧덩畑띕툍意뱊}猥롋꾧덩畑띕툍意뱊{^ 111001111000110010100101111010111010000110001011111010101011111010100111111010111000110110101001111001111001010110010001111010111001110110010101111011011000100010001101111001101000010010001111111010111011000110001010011111011110011110001100101001011110101110100001100010111110101010111110101001111110101110001101101010011110011110010101100100011110101110011101100101011110110110001000100011011110011010000100100011111110101110110001100010100111101101011110 e78ca5eba18beabea7eb8da9e79591eb9d95ed888de6848febb18a7de78ca5eba18beabea7eb8da9e79591eb9d95ed888de6848febb18a7b5e
UHC 猥롋꾧덩畑띕툍意뱊}猥롋꾧덩畑띕툍意뱊{^ 111010001110010110001110110100011000010011101010101101011010001011101111101001011011011011101011101110001000010111101011111100101001001101101110011111011110100011100101100011101101000110000100111010101011010110100010111011111010010110110110111010111011100010000101111010111111001010010011011011100111101101011110 e8e58ed184eab5a2efa5b6ebb885ebf2936e7de8e58ed184eab5a2efa5b6ebb885ebf2936e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)