To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 伍???ョ????v伍???ョ????vB 10001100110111100011111100111111001111111000001110000111001111110011111100111111001111110111011010001100110111100011111100111111001111111000001110000111001111110011111100111111001111110111011001000010 8cde3f3f3f83873f3f3f3f768cde3f3f3f83873f3f3f3f7642
EUC-JP 伍???ョ????v伍???ョ????vB 10111000111000000011111100111111001111111010010111100111001111110011111100111111001111110111011010111000111000000011111100111111001111111010010111100111001111110011111100111111001111110111011001000010 b8e03f3f3fa5e73f3f3f3f76b8e03f3f3fa5e73f3f3f3f7642
UTF-8 伍곹쓷溜ョ쭅溜욎텋v伍곹쓷溜ョ쭅溜욎텋vB 111001001011110010001101111010101011001110111001111011001001001110110111111011111010011110001011111000111000001110100111111011001010110110000101111011111010011110001011111011001001101010001110111011011000010110001011011101101110010010111100100011011110101010110011101110011110110010010011101101111110111110100111100010111110001110000011101001111110110010101101100001011110111110100111100010111110110010011010100011101110110110000101100010110111011001000010 e4bc8deab3b9ec93b7efa78be383a7ecad85efa78bec9a8eed858b76e4bc8deab3b9ec93b7efa78be383a7ecad85efa78bec9a8eed858b7642
UHC 伍곹쓷溜ョ쭅溜욎텋v伍곹쓷溜ョ쭅溜욎텋vB 111001111110101010000001111011011001110110010100111010101111111010101011111001111010011110000001111010101111111010011110111011001011011010001000011101101110011111101010100000011110110110011101100101001110101011111110101010111110011110100111100000011110101011111110100111101110110010110110100010000111011001000010 e7ea81ed9d94eafeabe7a781eafe9eecb68876e7ea81ed9d94eafeabe7a781eafe9eecb6887642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)