To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????[BF 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010110110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5b4246
SJIS-WIN 渦??擁?????渦??擁?????[BF 10001001010100010011111100111111100101110110100100111111001111110011111100111111001111111000100101010001001111110011111110010111011010010011111100111111001111110011111100111111010110110100001001000110 89513f3f97693f3f3f3f3f89513f3f97693f3f3f3f3f5b4246
EUC-JP 渦??擁?????渦??擁?????[BF 10110001101100100011111100111111110011011100101000111111001111110011111100111111001111111011000110110010001111110011111111001101110010100011111100111111001111110011111100111111010110110100001001000110 b1b23f3fcdca3f3f3f3f3fb1b23f3fcdca3f3f3f3f3f5b4246
UTF-8 渦욕뎴擁녕떥掠욂퓘渦욕뎴擁녕떥掠욆쥤[BF 111001101011100010100110111011001001101010010101111010111000111010110100111001101001001110000001111010111000010110010101111010111001011010100101111011111010010110110101111011001001101010000010111011011001001110011000111001101011100010100110111011001001101010010101111010111000111010110100111001101001001110000001111010111000010110010101111010111001011010100101111011111010010110110101111011001001101010000110111011001010010110100100010110110100001001000110 e6b8a6ec9a95eb8eb4e69381eb8595eb96a5efa5b5ec9a82ed9398e6b8a6ec9a95eb8eb4e69381eb8595eb96a5efa5b5ec9a86eca5a45b4246
UHC 渦욕뎴擁녕떥掠욂퓘渦욕뎴擁녕떥掠욆쥤[BF 111010001011111010111111111001011000100110000111111010001011011010110011111001111000101110111000111001011011000110011110111001001011111110000011111010001011111010111111111001011000100110000111111010001011011010110011111001111000101110111000111001011011000110011110111010001010001010010110010110110100001001000110 e8bebfe58987e8b6b3e78bb8e5b19ee4bf83e8bebfe58987e8b6b3e78bb8e5b19ee8a2965b4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)