To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霆?鬱?齬絅??姐?謎鬱?齬絅??姐? 1110100010111011001111111001111101010100001111111110101010010111111000110100010000111111001111111000100010110111001111111001001111100100100111110101010000111111111010101001011111100011010001000011111100111111100010001011011100111111 e8bb3f9f543fea97e3443f3f88b73f93e49f543fea97e3443f3f88b73f
EUC-JP 霆?鬱?齬絅??姐?謎鬱?齬絅??姐? 1111000010111101001111111101110110110101001111111111001111110111111001011010010100111111001111111011000010111001001111111100011011100110110111011011010100111111111100111111011111100101101001010011111100111111101100001011100100111111 f0bd3fddb53ff3f7e5a53f3fb0b93fc6e6ddb53ff3f7e5a53f3fb0b93f
UTF-8 霆렪鬱렏齬絅렋렡姐흘謎鬱렏齬絅렋렡姐퓻 111010011001110010000110111010111010000010101010111010011010110010110001111010111010000010001111111010011011110110101100111001111011010110000101111010111010000010001011111010111010000010100001111001011010011110010000111011011001110110011000111010001010110010001110111010011010110010110001111010111010000010001111111010011011110110101100111001111011010110000101111010111010000010001011111010111010000010100001111001011010011110010000111011011001001110111011 e99c86eba0aae9acb1eba08fe9bdace7b585eba08beba0a1e5a790ed9d98e8ac8ee9acb1eba08fe9bdace7b585eba08beba0a1e5a790ed93bb
UHC 霆렪鬱렏齬絅렋렡姐흘謎鬱렏齬絅렋렡姐퓻 1110111111111101100011101011100011101010101001101000111010100101111001011110000111001100111001111000111010100010100011101011001011101110101110111100100011101010110110101011101011101010101001101000111010100101111001011110000111001100111001111000111010100010100011101011001011101110101110111100011110111111 effd8eb8eaa68ea5e5e1cce78ea28eb2eebbc8eadabaeaa68ea5e5e1cce78ea28eb2eebbc7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)