To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 趙丞渕遲堤э鬘疲ア栲趙丞渕遲堤э鬘疲ア桀^ 111001101110001010001111111001011001111110111010111001111010110110010010111001111000010010001111111010011010000110010100111001101011000110011110011111011110011011100010100011111110010110011111101110101110011110101101100100101110011110000100100011111110100110100001100101001110011010110001100111100111101101011110 e6e28fe59fbae7ad92e7848fe9a194e6b19e7de6e28fe59fbae7ad92e7848fe9a194e6b19e7b5e
EUC-JP 趙丞渕遲堤э鬘疲ア栲趙丞渕遲堤э鬘疲ア桀^ 1110110011100100101111101110011111011110101111001110111010101111110001001110100110100111111011111111001010100011110010001110100010001110101100011101101111011110111011001110010010111110111001111101111010111100111011101010111111000100111010011010011111101111111100101010001111001000111010001000111010110001110110111101110001011110 ece4bee7debceeafc4e9a7eff2a3c8e88eb1dbdeece4bee7debceeafc4e9a7eff2a3c8e88eb1dbdc5e
UTF-8 趙丞渕遲堤э鬘疲ア栲趙丞渕遲堤э鬘疲ア桀^ 1110100010110110100110011110010010111000100111101110011010111000100101011110100110000001101100101110010110100000101001001101000110001101111010011010110010011000111001111001011010110010111011111011110110110001111001101010000010110010111010001011011010011001111001001011100010011110111001101011100010010101111010011000000110110010111001011010000010100100110100011000110111101001101011001001100011100111100101101011001011101111101111011011000111100110101000011000000001011110 e8b699e4b89ee6b895e981b2e5a0a4d18de9ac98e796b2efbdb1e6a0b2e8b699e4b89ee6b895e981b2e5a0a4d18de9ac98e796b2efbdb1e6a1805e
UHC 趙丞?遲堤э?疲??趙丞?遲堤э?疲?桀^ 11110000111000011110001110101010001111111111001011000000111100001010011110101100111011110011111111111001101010100011111100111111111100001110000111100011101010100011111111110010110000001111000010100111101011001110111100111111111110011010101000111111110010111111101001011110 f0e1e3aa3ff2c0f0a7acef3ff9aa3f3ff0e1e3aa3ff2c0f0a7acef3ff9aa3fcbfa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)