To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????z????z[????z????z[^ 0011111100111111001111110011111101111010001111110011111100111111001111110111101001011011001111110011111100111111001111110111101000111111001111110011111100111111011110100101101101011110 3f3f3f3f7a3f3f3f3f7a5b3f3f3f3f7a3f3f3f3f7a5b5e
SJIS-WIN 娼チ「z娼チ「z[娼チ「z娼チ「z[^ 10001111101010011100000110100010111101111111101101111010100011111010100111000001101000101111011111111011011110100101101110001111101010011100000110100010111101111111101101111010100011111010100111000001101000101111011111111011011110100101101101011110 8fa9c1a2f7fb7a8fa9c1a2f7fb7a5b8fa9c1a2f7fb7a8fa9c1a2f7fb7a5b5e
EUC-JP 娼チ「?z娼チ「?z[娼チ「?z娼チ「?z[^ 1011111010101011100011101100000110001110101000100011111101111010101111101010101110001110110000011000111010100010001111110111101001011011101111101010101110001110110000011000111010100010001111110111101010111110101010111000111011000001100011101010001000111111011110100101101101011110 beab8ec18ea23f7abeab8ec18ea23f7a5bbeab8ec18ea23f7abeab8ec18ea23f7a5b5e
UTF-8 娼チ「z娼チ「z[娼チ「z娼チ「z[^ 11100101101010001011110011101111101111101000000111101111101111011010001011101110100101111001111001111010111001011010100010111100111011111011111010000001111011111011110110100010111011101001011110011110011110100101101111100101101010001011110011101111101111101000000111101111101111011010001011101110100101111001111001111010111001011010100010111100111011111011111010000001111011111011110110100010111011101001011110011110011110100101101101011110 e5a8bcefbe81efbda2ee979e7ae5a8bcefbe81efbda2ee979e7a5be5a8bcefbe81efbda2ee979e7ae5a8bcefbe81efbda2ee979e7a5b5e
UHC 娼???z娼???z[娼???z娼???z[^ 111100111101111000111111001111110011111101111010111100111101111000111111001111110011111101111010010110111111001111011110001111110011111100111111011110101111001111011110001111110011111100111111011110100101101101011110 f3de3f3f3f7af3de3f3f3f7a5bf3de3f3f3f7af3de3f3f3f7a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)