To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????U 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f55
SJIS-WIN 鳶??言?鳶??言?鳶??言?鳶??言?U 1001001111001110001111110011111110001100101111100011111110010011110011100011111100111111100011001011111000111111100100111100111000111111001111111000110010111110001111111001001111001110001111110011111110001100101111100011111101010101 93ce3f3f8cbe3f93ce3f3f8cbe3f93ce3f3f8cbe3f93ce3f3f8cbe3f55
EUC-JP 鳶??言?鳶??言?鳶??言?鳶??言?U 1100011011010000001111110011111110111000110000000011111111000110110100000011111100111111101110001100000000111111110001101101000000111111001111111011100011000000001111111100011011010000001111110011111110111000110000000011111101010101 c6d03f3fb8c03fc6d03f3fb8c03fc6d03f3fb8c03fc6d03f3fb8c03f55
UTF-8 鳶멩뿦言뎉鳶멩뿦言뎆鳶멩뿦言덾鳶멩뿦言덵U 11101001101100111011011011101011101010011010100111101011101111111010011011101000101010001000000011101011100011101000100111101001101100111011011011101011101010011010100111101011101111111010011011101000101010001000000011101011100011101000011011101001101100111011011011101011101010011010100111101011101111111010011011101000101010001000000011101011100011011011111011101001101100111011011011101011101010011010100111101011101111111010011011101000101010001000000011101011100011011011010101010101 e9b3b6eba9a9ebbfa6e8a880eb8e89e9b3b6eba9a9ebbfa6e8a880eb8e86e9b3b6eba9a9ebbfa6e8a880eb8dbee9b3b6eba9a9ebbfa6e8a880eb8db555
UHC 鳶멩뿦言뎉鳶멩뿦言뎆鳶멩뿦言덾鳶멩뿦言덵U 1110011011101001101110001110011010010111101001101110010111101011100010010101011111100110111010011011100011100110100101111010011011100101111010111000100101010100111001101110100110111000111001101001011110100110111001011110101110001001010100011110011011101001101110001110011010010111101001101110010111101011100010010100100101010101 e6e9b8e697a6e5eb8957e6e9b8e697a6e5eb8954e6e9b8e697a6e5eb8951e6e9b8e697a6e5eb894955

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)