To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 腸?億??????刻腸?億???????? 1001001010110000001111111000100110101101001111110011111100111111001111110011111100111111100011011000111110010010101100000011111110001001101011010011111100111111001111110011111100111111001111110011111100111111 92b03f89ad3f3f3f3f3f3f8d8f92b03f89ad3f3f3f3f3f3f3f3f
EUC-JP 腸?億??????刻腸?億???????? 1100010010110010001111111011001010101111001111110011111100111111001111110011111100111111101110011110111111000100101100100011111110110010101011110011111100111111001111110011111100111111001111110011111100111111 c4b23fb2af3f3f3f3f3f3fb9efc4b23fb2af3f3f3f3f3f3f3f3f
UTF-8 腸렑億또뀜렒띤렗뀄刻腸렑億또뀜렒띤렗곡렰굻 111010001000010110111000111010111010000010010001111001011000010010000100111010111001100010010000111010111000000010011100111010111010000010010010111010111001110110100100111010111010000010010111111010111000000010000100111001011000100010111011111010001000010110111000111010111010000010010001111001011000010010000100111010111001100010010000111010111000000010011100111010111010000010010010111010111001110110100100111010111010000010010111111010101011001110100001111010111010000010110000111010101011010110111011 e885b8eba091e58484eb9890eb809ceba092eb9da4eba097eb8084e588bbe885b8eba091e58484eb9890eb809ceba092eb9da4eba097eab3a1eba0b0eab5bb
UHC 腸렑億또뀜렒띤렗뀄刻腸렑億또뀜렒띤렗곡렰굻 111011011111001110001110101001101110010111100010101101101100011110110010111100011000111010100111101101101110110110001110101011001011001011101101110010101011111011101101111100111000111010100110111001011110001010110110110001111011001011110001100011101010011110110110111011011000111010101100101100001110111010001110101111011011000110111111 edf38ea6e5e2b6c7b2f18ea7b6ed8eacb2edcabeedf38ea6e5e2b6c7b2f18ea7b6ed8eacb0ee8ebdb1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)