To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 渦??擁?????z渦??擁?????zB 10001001010100010011111100111111100101110110100100111111001111110011111100111111001111110111101010001001010100010011111100111111100101110110100100111111001111110011111100111111001111110111101001000010 89513f3f97693f3f3f3f3f7a89513f3f97693f3f3f3f3f7a42
EUC-JP 渦??擁?????z渦??擁?????zB 10110001101100100011111100111111110011011100101000111111001111110011111100111111001111110111101010110001101100100011111100111111110011011100101000111111001111110011111100111111001111110111101001000010 b1b23f3fcdca3f3f3f3f3f7ab1b23f3fcdca3f3f3f3f3f7a42
UTF-8 渦욕뎴擁녕떥掠욄뜆z渦욕뎴擁녕떥掠욄뜆zB 111001101011100010100110111011001001101010010101111010111000111010110100111001101001001110000001111010111000010110010101111010111001011010100101111011111010010110110101111011001001101010000100111010111001110010000110011110101110011010111000101001101110110010011010100101011110101110001110101101001110011010010011100000011110101110000101100101011110101110010110101001011110111110100101101101011110110010011010100001001110101110011100100001100111101001000010 e6b8a6ec9a95eb8eb4e69381eb8595eb96a5efa5b5ec9a84eb9c867ae6b8a6ec9a95eb8eb4e69381eb8595eb96a5efa5b5ec9a84eb9c867a42
UHC 渦욕뎴擁녕떥掠욄뜆z渦욕뎴擁녕떥掠욄뜆zB 111010001011111010111111111001011000100110000111111010001011011010110011111001111000101110111000111001011011000110011110111001101000110110001001011110101110100010111110101111111110010110001001100001111110100010110110101100111110011110001011101110001110010110110001100111101110011010001101100010010111101001000010 e8bebfe58987e8b6b3e78bb8e5b19ee68d897ae8bebfe58987e8b6b3e78bb8e5b19ee68d897a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)