To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 終戡????肄???烝?終戡????肄???烝?^ 100011110100100110011101010000010011111100111111001111110011111111100011111001010011111100111111001111111110000001111110001111111000111101001001100111010100000100111111001111110011111100111111111000111110010100111111001111110011111111100000011111100011111101011110 8f499d413f3f3f3fe3e53f3f3fe07e3f8f499d413f3f3f3fe3e53f3f3fe07e3f5e
EUC-JP 終戡??佺?肄???烝?終戡??佺?肄???烝?^ 10111101101010101101100110100010001111110011111110001111101100001111100100111111111001101110011100111111001111110011111111011111110111110011111110111101101010101101100110100010001111110011111110001111101100001111100100111111111001101110011100111111001111110011111111011111110111110011111101011110 bdaad9a23f3f8fb0f93fe6e73f3f3fdfdf3fbdaad9a23f3f8fb0f93fe6e73f3f3fdfdf3f5e
UTF-8 終戡렰렦佺렦肄폈렰렯烝쌨終戡렰렦佺렦肄폈렰렯烝쌤^ 11100111101101011000001011100110100010001010000111101011101000001011000011101011101000001010011011100100101111011011101011101011101000001010011011101000100000101000010011101101100011111000100011101011101000001011000011101011101000001010111111100111100000111001110111101100100011001010100011100111101101011000001011100110100010001010000111101011101000001011000011101011101000001010011011100100101111011011101011101011101000001010011011101000100000101000010011101101100011111000100011101011101000001011000011101011101000001010111111100111100000111001110111101100100011001010010001011110 e7b582e688a1eba0b0eba0a6e4bdbaeba0a6e88284ed8f88eba0b0eba0afe7839dec8ca8e7b582e688a1eba0b0eba0a6e4bdbaeba0a6e88284ed8f88eba0b0eba0afe7839dec8ca45e
UHC 終戡렰렦佺렦肄폈렰렯烝쌨終戡렰렦佺렦肄폈렰렯烝쌤^ 11110000111110111100101011110001100011101011110110001110101101011110111011101101100011101011010111101100101111011100011011110001100011101011110110001110101111001111000111110110101111011101111011110000111110111100101011110001100011101011110110001110101101011110111011101101100011101011010111101100101111011100011011110001100011101011110110001110101111001111000111110110101111011101110001011110 f0fbcaf18ebd8eb5eeed8eb5ecbdc6f18ebd8ebcf1f6bddef0fbcaf18ebd8eb5eeed8eb5ecbdc6f18ebd8ebcf1f6bddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)